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Preface 



We give a review of the most important results on optimal tomography as math- 
ematical wave-pattern recognition theory emerged in the 70 's in connection with 
the problems of optimal estimation and hypothesis testing in quantum theory. 
The key problems in this theory is mathematical analysis and synthesis of the 
optimal dynamic analyzer discriminating between a given, discrete or continu- 
ous, family of pure or mixed a priori unknown wave patterns. Classical pattern 
recognition theory as a part of classical mathematical statistics cannot tackle 
such problems since it operates with given sample data and is not concerned 
how these date should be obtained from the physical wave states in an optimal 
way. In quantum theory this problem is sometimes referred as the problem of 
optimal measurement of an unknown quantum state, and is the main problem 
of the emerging mathematical theory of quantum statistics. 

We develop the results of optimal quantum measurement theory, most of 
which belong to VPB |ni-|22i further into the direction of wave, rather than 
particle statistical estimation and hypothesis testing theory, with the aim to 
include not only quantum matter waves but also classical wave patterns like op- 
tical and acoustic waves. We apply the developed methods of this new pattern 
recognition theory to the problems of mathematical design of optimal wave an- 
alyzer discriminating the visual and sound patterns. We conclude that Hilbert 
space and operator methods developed in quantum theory are equally useful 
in the classical wave theory, as soon as the possible observations are restricted 
to only intensity distributions of waves, i.e. when the wave states are not the 
allowed observables, as they are not the observables of individual particles in 
the quantum theory. We will show that all characteristic attributes of quan- 
tum theory such as complementarity, entanglement or Heisenberg uncertainty 
relations are also attributes of the generalized wave pattern recognition theory. 
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Introduction 



The problem of automatic recognition of coherent wave patterns is coming to the 
forefront in connection with one of the most important problems in building fifth 
generation computational systems, that are either of classical wave input and 
output or of quantum nature. The problem of efficiency for such wave pattern 
analyzers contains a number of mathematical subproblems, the most important 
of which have been studied in the seventeen's in connection with the quantum 
estimation and hypothesis testing theory. Without going into details of this and 
related quantum measurement theory these problems can be explained as the 
problems of (mathematical) design of an optimal analyzers of classical optical 
or acoustic waves, the devices occupying the center of the stage in the problem 
of automatic recognition of visual or sound patterns. An example of such a 
device is given in |45j : a receiver of acoustic waves v(x — ct) whose idealized 
model is a point-like resonator (or cavity) capable of measuring the intensities 
of the vibrational modes excited by the waves. The modes are the natural 
vibrations of one or several standards placed at point a; = and are described 
by an orthonormal set of functions Xfe(i) on a given interval of observation 
[0,T]. A typical example of such a resonator is the spectrum analyzer, a device 
that measures the intensity distribution over the discrete frequencies fk = k/T , 
k ^ N, and can be represented by a selective filter of harmonic waves 

Vk{x — ct) — 2 Re (f)'^ exp{27rjfc(a; — ct)/cT}, j = \/— 1, 

in the output of which one can measure the positive numbers z^*^ — \(t>'^\'^ de- 
termined by the complex- valued amplitudes (j)'^ G C^. The spectrum selector 
described by the harmonic functions 

Xfe(t) = exp{-27rj/ct/T}, A:-0,1,... 

which form an orthonormal set {x^} with respect to the scalar product 

Jo 

is ideally suited for the discrimination of pure tones with multiple frequencies 
{/i}, tones described by disjoint complex-valued amplitudes ip'l — aX i k 
corresponding to the harmonic waves 

Vi{x — ct) = 2'Re(p^{t — x/c), 
vii 
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where ^Piit) = J^kLo ^iXki^)- To establish which of the tones in {ip^} with dif- 
ferent frequencies and nonzero intensities = ^ at i = fc is actually 
detected by such a receiver, it is sufficient to find the number i of the excited 
standard tuned to one of the harmonic modes in {xj} corresponding to the set 
{<Pj} : Xiit) = 'Piit)/ WfiW- The vibrational energy of such a standard will coin- 
cide with the intensity \{xi \ y'JP = llv'ill^ of the detected signal ip^, while the 
other standards remain unexcited: {xk \ fi) = at k ^ i. A segment of human 
speech of duration T containing a finished sentence consists, however, not of a 
single pure tone but, generally, of an infinitude of pure tones of different ampli- 
tudes and frequencies (the frequencies may be assumed to be multiples of 1/T 
at a fixed interval T of a single reception act). Unharmonic signals, described 
by spectral amplitudes ip^ — [4>i]'^Q or, in the temporal representation, by the 
analytic signals 

oo 
fc=0 

may be indistinguishable in spectral measurements, even if they are orthogonal. 
For example, if the (p^it), i — 1, . . . ,m are disjoint pulses obtained through the 
shift by T of the pulsed signal (pgit) of length At = T/m, these pulses have cor- 
responding to them the orthogonal spectral amplitudes ip'^ = 0q exp{27rjifc/m} 
with the same intensity distributions u'^ = p = \<t)Q\'^, i = 1, . . . ,to, k G N. 

Orthogonal sound signals {ipi} on [0, T] may be identified in a similar manner 
by selective filters matched with the signal modes Xi(0 = / WfiW that mea- 
sure the intensity distribution v'' = {xk I 'Pi)\'^ in the modes {Xi}, a distribution 
that has a different form for different values of i, namely, i^^ — \\(p^\\'^ S^. How- 
ever, different sentences in human speech correspond ordinarily to nonorthogo- 
nal sound signals fiit), which from the viewpoint of their meaning are identified 
if they are coUinear, that is, differ only in the total energy \\(Pi\\ = J \ipi(t)\'^dt. 
For the recognition of nonorthogonal signals {</?i}i" i one cannot employ matched 
filtration since the filters described by the nonorthogonal modes Xi = Vi/ ll'y'ill 
are noncommutative and, hence, cannot be matched in a single selector; oth- 
erwise, the total measured intensity J2T=i \iXk I Xi)P; would exceed the total 
energy \\pi\\'^ of the received signal ip^ if {ip^ \ ipf.) 7^ at least for one k ^ i. 
Thus, we have an indefinite situation, formally similar to the incompatibility 
of noncommutative quantum mechanical observables, a situation arising from 
Bohr's complementarity principle 23 and Heisenberg's uncertainty relation ji6 . 
Typical examples of noncommutative filters are the frequency and temporal fil- 
ters, which are incompatible, just as position and momentum measurements are 
incompatible in a quantum mechanical system. So which of the disjoint selec- 
tors described by orthonormal sets of modes {Xk} niust we employ to discern 
the nonorthogonal sound signals from a given set {(^S;}? It is natural to look 
for the answer to this nontrivial question in the form of a solution to an opti- 
mization problem by selecting a criterion of discernment quality such that the 
optimal selector does not depend on the gauge transformation ip^ Xp.^ for 
every complex- valued A. The latter condition is satisfied by the criterion of the 



ix 



maximum of tiie total intensity X^I^i \iXi I of the true received amplitudes 
or by the criterion of the minimum of the lost intensity J2i^k K^i I 
corresponding extremal problem for arbitrary nonorthogonal amplitudes {ip^.} 
describing quantum mechanical states normalized to prior probabilities was first 
studied in the general form in J3] , J^l , QE| • Particular solution of this prob- 
lem for the case of two nonorthogonal amplitudes {ipQ, ipi} were obtained in [2], 
|34) . while the case of several linearly independent amplitudes {(fii} was also 
considered in ^H], [ST]. 

The above-noted analogy between optimal recognition of sound signals and 
discernment of quantum mechanical states suggested the possibility of construct- 
ing a wave theory of noncommutative measurements within the scope of which 
one could solve more general problems of testing wave hypotheses for estimat- 
ing the wave parameters. From the formal viewpoint this theory generalizes the 
quantum theory of optimal measurements, hypothesis testing, and estimation 
of parameters PD], while actually it carries the mathematical methods and 
ideas developed in 0-1221 i'^to new, practically more realistic, field of ap- 
plications. A short author's review of optimal processing of quantum signals is 
given in JSj. (Additional literature on the quantum theory of detection, hy- 
pothesis testing, and parameter estimation can be found in the references cited 
inlSH, IMI.) 

In the present text we give a systematic description of the wave theory of 
representation and measurement based on analogies with quantum mechanics. 
This theory is then employed to solve the problems of detection, discrimination, 
identification, and estimation of the parameters of sound signals and visual 
patterns within the framework of the noncommutative theory of testing wave 
hypotheses developed here. The idea of employing the methods of quantum 
mechanics for discerning wave patterns emerged at the beginning of the 1970s, 
when a seminar on quantum mechanics and pattern recognition was organized 
in the Physics Department of Moscow State University. The seminar was di- 
rected by V.P. Maslov, along with Yu.P. Pyt'ev, and the author, V.P. Belavkin, 
attended it. Interest in wave tomography as reconstruction of the complete wave 
field rather than only the energy illumination of the image in a certain plane 
was stimulated by the rapid development of holography, which was invented by 
Gabor -25^, and then underwent a revival 24' when coherent sources of light, 
or lasers, were created. The emerging optimization problems of discerning wave 
fronts are similar to the problems of discerning quantum mechanical states and 
cannot be solved by classical methods of pattern recognition since it is im- 
possible to register directly and exactly the phase and amplitude of a wave field 
by measuring the energy parameters. A detailed study of these problems at 
the time showed that the then existing quasiclassical methods of solving 
quantum mechanical problems were also inadequate, and the solution had to 
be postponed until a consistent noncommutative theory of measurements was 
developed in the then rapidly advancing field of quantum theory. 

Some particular problems of optimal processing optical wave signals, such 
as those of optical localization 021) detection, and discrimination of two signals 
from closely positioned sources of coherent radiation , ^21 , have been already 
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well studied by methods of quantum statistical and nonlinear optics |26j . |41| . 

ID 

In this book, in addition to discussing the noncommutative theory of mea- 
surements common for quantum states and classical sound signals and coherent 
optical waves, we provide solutions to a number of classical wave recognition 
problems from the quantum optimal measurement theory (obtained by the first 
author, V.P. Belavkin, in the 70's, when solving similar problems for quantum 
signals). Content and commentaries to the list of literature are presented in 
brief summaries at the beginning of each section. Similar summaries are given 
at the beginning of each subsection. The subsections are written as a series 
of articles of increasing complexity so that each can be read independently, al- 
though the best way to understand the material is to carefully read the articles 
in the order given. 



Chapter 1 



Representation and 
Measurement of Waves 

In this section we discuss the mathematical apparatus of the wave theory of 
representation and measurement of sound and visual patterns. Along with pure 
wave patterns, which are described by coherent signals and fields, we also con- 
sider the representation of mixed patterns, which are described by partially 
coherent and incoherent signals and fields. In addition to the spatial- frequency 
(coordinate) and wave-temporal (momentum) representations we introduce the 
joint canonical representation, in which the pure and mixed patterns are de- 
scribed by entire functions and kernels in a phase complex space. We develop 
the mathematical theory of idea filters and quasifilters, disjoint selectors, and 
quasiselectors, which describe ordinary, successive, and indirect measurements 
of wave-pattern intensity distributions. Using this theory as a basis, we analyze 
coordinate and momenta measurements as well as quasimeasurements of joint 
coordinate- momentum distributions. The mathematical tools used here are in 
many respects similar to those used in the quantum theory of representations 
and measurements |42J, which recently received a new impetus in connection 
with problems of quantum states recognition CHIi Elli 001 j 

however, we give a wave rather than a statistical interpretation of the apparatus 
in accordance with the application considered here. 

1.1 Mathematical Description of Wave Patterns 

In this section we describe three basic types of representation of pure and mixed 
wave patterns; the coordinate, or spatial-frequency, the momentum, or wave- 
temporal, and the canonical, in which the wave patterns are represented by 
holomorphic amplitudes in the complex coordinate-momentum plane. The third 
representation, which emerged in quantum optics 26 , proves useful in an anal- 
ysis of the frequency-temporal structure of sound and visual patterns and in 
holography in an analysis of the spatial-temporal structure of such patterns. 
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1.1.1 Wave Sound and Visual Patterns 

The sound and visual patterns considered here are commonly described by wave 
amplitudes v{t,q) that are real- valued function of time t and coordinates q in 
a spatial- temporal region accessible for measurement. Although the simplest 
wave equations describing the behaviour of such physical fields arc linear in 
amplitudes v{t,q), for purposes of measurement of sound and visual patterns 
the most interesting are functions quadratic in v that describe the distribution of 
sound on the standards of the dynamic analyzer or of light on photodctectors, a 
distribution that in spatial-temporal measurements is described by the intensity 
function v^{t,q}/2. More useful information is proved not by the intensities of 
a sound or visual pattern at points {t, q) but by the distribution of the sound 
intensity at frequency / (a spatial- frequency distribution), as is common in an 
analysis of colour patterns. Such a distribution is determined by the intensity 
function 

^(/,q) = b(/,q)l', />0, (1.1) 

with if the complex-valued spectral amplitudes, 

/oo 
z;(t,q)e2-j/*dt, 
-CSO 

used to represent the wave field v{t,q) in the form of a linear combination of 
harmonic oscillations at each point q: 

t;(i,q) = 2Re/ ^(/, q)e-2-J/* d/. 
Jo 

Bearing in mind the well-known advantage of employing complex-valued am- 
plitudes, in what follows we consider complex-valued signals (f{x) = ip{f, q) = 
i^{q), with X = f{f, q) = q, assuming that in a given spatial- frequency region of 
measurements Q they possess a finite total intensity 

I{<p)= [ \<f{q)\'dq. (1.2) 
Jn 

When registering sound signals for which the spatial region of measurements is 
much smaller than the characteristic length of the sound wave, we can take for 
O a one-dimensional region, which is usually determined by a positive-frequency 
pass band <I> G of the dynamic analyzer, assuming that x = f at the point of 
its localization q = 0; recognizing visual patterns usually requires only a three- 
dimensional region O = $ x 5, where $ is the optical frequency band and S the 
surface on which the pattern is localized; for static patterns x = q (t = 0). 

From the standpoint of physics the admissible amplitudes are smooth am- 
plitudes <p(g), q gCI, with compact supports inside a (rf-|- l)-dimensional region 
rt C R''+i or, if n is noncompact, amplitudes that fall off rapidly at infinity 
together with all their derivatives. Such amplitudes generate a Hilbert space 
H = L^(f2) of amplitudes xil) of finite intensity ||x||^ < oo: 

{^\X)= I ¥'(<?)*X(«) dg. (1.3) 
Jo. 
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Generally the set of basic amplitudes (p can form an arbitrary complex- values 
space with a positive Hermitian form I{ip) — {ip \ tp) that is invariant with 
respect to complex conjugation (p* (q) = p{q)*- This form defines a finite inten- 
sity, 7^ 0, for any nonzero tp. We denote the completion of this space in 
norm \\ip\\ — /^/^((/?) by Ti and consider it to be a Hilbert space equipped with 
an isometric involution X* with respect to the scalar produce H1.3|l . which 
is linear in the second argument ip G 7i. 

The use of complex-valued amplitudes not only considerably simplifies the 
formulas for calculating the observed distributions of the fields but also makes it 
possible to employ analogies from quantum theory. Specifically, as in quantum 
theory, complex-valued amplitudes differing in a phase factor must be assumed 
"equal" since they lead to the same intensities defined by Hermitian forms of p. 
Note that in the quantum description of optical and sound signals we usually 
take the mean number of the corresponding quanta (photons and phonons) as 
the intensity functions for light and sound, respectively. These quantities are 
determined by the same Hermitian forms of the complex-valued amplitudes p 
as in the classical mode of description, provided that the quantum mechanical 
states are coherent [211, 03 1 that is, are described by Poisson probability ampli- 
tudes \(p>). Thus, restricting ourselves to intensity measurements, we postulate 
that only distributions of quanta are observable, while the only characteristics 
of signals p of interest to physics are those obtained as a result of measurement 
of such distributions. 



1.1.2 Momentum Representation 

In problems dealing with the recognition of moving patterns what may be of in- 
terest is not the spatial-frequency intensity distribution (|l.l(l but the momentum- 
temporal distribution described by the function 

T(i,p) = |^(i,p)|2, (<,p)eM'^+i, (1.4) 

where ^(i,p) is the involution Fourier transform, 

^(t,p) = JJ ^(/,q)*e2-Kt/+P q)d/dq. (1.5) 

We introduce the notation x — {t, p) = p. The representation of amplitudes 
ip in terms of the functions ^{x) — ^{t,p) = p{p) is called the momentum 
representation.^ Note that this representation differs from the common one by 
complex conjugation, ^* = but this difference is "unobservable" from the 
viewpoint of measuring intensities described by Hermitian forms 1)1. l|l and H1.4|l . 
which arc invariant under such conjugation. 
Allowing for Plancherel's equality 

/ mP)\'dp^ J \ip{q)\^dq, (1.6) 

^Thanks to the introduction of the involution ip ip* in transformation II. 5i . the inverse 
transformation to the coordinate representation is carried out by the same formula 11.51 ) , 
ip = ip, which can be extended onto generalized amplitudes x in the standard manner. 
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we find that the total intensity described by the distribution function (|1.4|l 
coincides with the total intensity (|1.2() for any amphtude ip with support in f2: 

T{p)dp = I{cp)^ J i{q)dq. (1.7) 

In what foUows we consider the values of L{q) and 7{p) in Hl.l|l and (|1.4|) as 
being functions of tp, denoting by T((/3, g) and 7(93, p) the functionals connected 
by involutions T((y5,p) ~ t-i^jP) SLn.d7{ip,q) = L{ip,q), respectively. Note that the 
Fourier transformation t(g) >—>T(p) of the Hermitian functional t{^,q) — \'~p{c[)\^ 
cannot be reduced to the Fourier transformation of its value (.(<^) as a function 
of q. More than that, measuring the spatial distribution T(q) for a single value of 
ip does not generally make it possible in any way to calculate the corresponding 
distribution T(p), and vice versa. Nevertheless, these distributions satisfy certain 
relationships, the simplest of which are H1.7|l and the inequality 



{p - pfT{p) dpj{q~ q)\{q) dq > j^l\p) (1.8) 

(where p = J pL{p) dp/I{ip) and q = J q^iq) dq/I{ip) for each of the components 
p = pk and q ^ qk, k ^ 0, . . . , d), which is known as the uncertainty relation. 
Using the commutation relations 

(27rj)^iT (1.9) 

For each pair of operators g,p in the p-representation, with p = p — p and 
q = d/d{2iT]p) = q, we can easily arrive at ()1.8|l as a corollary of Schwarz's 
inequality 

llp^ll >\ipv\m> IM25^ I 9^)1 - 27 ii^ii' • (1-10) 

Indeed, according to definition (|1.4|) we have 

{P - pf'i^ip) dp^ j \{p~ p)p{p)\^ dp = IIp^II^ 

and, similarly, allowing for (|1.6() , for the Fourier transform pp of the function 
(g — q)'f{q) we obtain from (|l.l|l 

J {q-tfi^{q)dq = j\{q-q)'p{q)\^ dq = \\qp\f . 

Thus, inequality H1.8|l is equivalent to (|1.1Q(I . where ||(^||^ — I{p>)- For nonzero 
amplitudes Lp this inequality is usually written as 

CTpCT, > l/47r, (1.11) 

where Up and Uq are the standard deviations, 



„2 / -^2 



{p-pfL{p)dp/I{p), 



= iq-q)hiq)dq/I{p), (1.12) 
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of momentum p and coordinate q in the wave packet ip from their mean values 
p and q. In this form Hl.ll|l is similar to the quantum mechanical Heisenberg 
uncertainty relation; however, here the standard deviations (I1.12|l have no sta- 
tistical meaning but characterize the extent to which the intensity distributions 
are localized in the coordinate and momentum spaces. The lower bound l/(47r) 
in this relation is achieved only in the case of an unbounded region = M''+^ 
for the Gaussian amplitudes 

^(g) = C, exp{27rj(<z - |)g - ^^}- (1-13) 

These amplitudes, with the normalization constants Cq — \ j (2Tia'^^^'^^^^ I , have 
a similar form in the p-representation: 

V{V) = C'pexp{27rj(p- ^j} }, 

with Cp = l/(27rCTp)'^''+^^/'' and a^Oq = l/(47r), are called standard canonical 
(Poisson) amplitudes and are denoted by = W), with 

" = T^K— +J — ) 

if Gq and Cp are fixed. Note that for a ^ a' such amplitudes are nonorthogonal: 

(a I a') = exp{-i |af + - 1 |a|'}, (1.14) 
with a'a^ defined as the scalar product 'Y^'l^Q(y*a'^. 

1.1.3 Mixed Signals 

Due to limits in present-day technology, only a fraction of the information on 
the intensity distribution of amplitude ip in this or another region can usually 
be obtained when analyzing sound and visual patterns. For instance, in sound 
pattern recognition the common method is to use only the frequency or temporal 
distribution obtain through integration 

= / l¥'(/,q)l' dq, ~L{t) = j dp (1.15) 

of distributions Hl.l|l and (|1.4|l over the spatial or wave region of measurement. 
In visual pattern recognition often only black and white patterns are considered. 
These are obtained as the result of mixing 

^(q) = j l¥'(/,q)l'd/, 7:(p) = j m.v)? At (i.ie) 

of the appropriate colour patterns in the spatial or wave region of measurement. 
To obtain such incomplete distributions there is no need to provide a total 
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description of the signal by amplitude ip{f, q). For instance, in describing sound 
it is sufficient to specify only the Hermitian kernel 

SifJ)^ J ^(/',q)^*(/,q)dg, 

for which S{fJ) andl(t) = S{t,t), where 

S{t',t)^ J e2-j(/*'-/'*)5(/',/)d/'d/. 

Monochrome patterns are defined by a similar kernel ^(q'jq), with t(q) — 
S'(q, q) and T(p) — 5(p, p). Having in mind the possibility of such mixing, we 
will describe signals in an abridged manner by nonnegative definite operators 
of intensity density, 5*, with kernels S{q, q') that have a nonzero trace 

TvS= [ S{q,q)dq= {S,I) , (1.17) 

which determines the total intensity, t(5) — {S,I), of the signal in fl. To each 
amplitude ip{q) we assign a one-dimensional operator S — |V-')(V'I with a kernel 

S{ci,q)^i,{4)r{q). (1.18) 

which defines the amplitude -0(9) to within a nonessential phase factor e^^. 
The diagonal values t(g) — S{q, q) describe the coordinate distribution of the 
intensity of such a signal, while the momentum distribution is described by the 
diagonal values T(p) = S{p,p) of the involution Fourier transform 

Sip',p)= y"e2'^j(^''«'-«'P''5(q',q)dq'dq. (1.19) 

Each such kernel can be obtained as a result of mixing 

S{q\q)^ J ^Pjq'WJqHda) (1.20) 

of the one-dimensional kernels corresponding to the normalized amplitudes {ipa}, 
llV'all = Ij parameterized by a space A with a nonzero positive measure v whose 
mass determines the total intensity {S,I) — i^iA). For example, the kernel 
S'(q', q) corresponding to a monochrome pattern generated by amplitude (p{ f, q) 
can be written in the form H1.20|l for 

V'/(q) = ^(/,q)A'/'(/) 

on the set A of frequencies $ equipped with a nonzero measure i^{df) = i{f) df. 

In the case of an arbitrary Hilbert space 7i, mixed signals are described by 
density operators S obtained as a result of weak integration 



S= / IVJ(ValKda) 



(1.21) 
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of one-dimensional density operators = \ip){'4'\, 

|^)(V'|:xe7^^V(V'lx) = (V'lx)V', 



(1.22) 



corresponding to the normalized values ip„ € 7i, a e A, of the vector function 
a i—> V'q • The operators (|1.21(l are kernel-positive and have a finite trace Tr 5 = 
i^(A), with each operator S : H H being represented in the form p. 21(1 via, 
say, the spectral decomposition 



5 = 



(1.23) 



Here {"0^} is the maximal orthogonal set of normalized eigenvectors tp^ £ Ti 
corresponding to zero eigenvalues Ui : Sip^ = i^iV'jj which determine the trace 
Tr5 = Y.^v^■ 

1.1.4 Gaussian Signals 

As an example let us consider the important class of mixed canonical signals 
defined by the integration 



5 = 



d+l 



(1.24) 



la), which in the 



of canonical projectors corresponding to the amplitudes t}} 

coordinate representation have the following general Gaussian form (c.f. (|1.13|) ) 

^^^(g) = Cexp{2^j(g - \S,W - W - 0^(9 - (1.25) 

Here w = w''" is a symmetric complex-valued {d + l)-hy-{d + 1) matrix with a 
positive definite real part 

w + cj* = 27r(utw)-i, 

^ — •\/2/7r Re av and -q ~ ■\/2/7rReja'''t;, with v — v'^uj/{2tt)^ are {d + 1)- 
dimensional rows, (J and rf^ are the corresponding columns, 

|Cp =det(utw)-i = \v\-^ 

is the normalization constant, and v*"^ — v'^ — v"^* is the Hermitian conjugate 
of matrix v. 

Let C = (^, if) be a 2(cZ-|- l)-dimensional row and v{dC,) — v{dS^ drf) a Gaussian 
measure on A = M^(''+-'^) normalized to certain number J < oo (the Gaussian 
intensity) and described (for J positive) by the following moments: 



-^jiv{dC)^\ 1] ^ J TJI^idC) 



= .r 



(1.26) 



(1.27) 
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where a^^ 



is a nonnegative define 2((i+ l)-by-2((i+ 1) matrix. 

The signals that correspond to such a density operator H1.24II are characterized 
by the following first moments: 



(1.28) 



qfq = 












qTp = 




f {Q'^c 


Pyj^MdC) 




jv'^v/(2Tr), 


pTq = 




f (PT^^ 








pfp = 






p^p^HdC) 




-(^3 + 5*)-! 



(1.29) 



where CD/ (27r) = 27r/cj*, Q and P are the rows of operators of position Qk 
and momentum Pk defined in the ^-representation via multiplication by qk and 
differentiation with respect to qk, or {2iT])~^d / dqk, and we have allowed for the 
fact that 



= [v\]v) + {v\v)/2^) 



(1.30) 



for the "shifted" operators q = Q — Til and p = P — pi. 

It can easily be demonstrated that for a nonsingular Gaussian measure de- 
scribed by density 71(C) ~ ^^('^O/dC of the form 



(1.31) 



(with 9 — {>c,X) and C — J / ^ydet2^^a^) corresponding to a nonsingular cor- 
relation matrix cr^ ,j we can select representation H1.24II of the density operator 
S by appropriate choice of matrix to in such a manner that the representation 
will be described by density p. 31(1 with the matrix (7i^(^ of the form 



c^CC = -Rc [(w*,jw)'l's(w*,ju)] 



(1.32) 



where s is a complex- valued positive definite (d -I- l)-hy-{d + 1) matrix. At 
this point it is expedient to introduce a complex- valued normal representation 
characterized by the transition made from 2{d+ 1) real variables ( = {£,,ri) to a 
{d + l)-dimcnsional complex variables 

a = -{^uj + 2nir]), 

7 
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where 7 = {uj + uj*y^'^ in terms of which the density H1.31|l combined with H1.32|l 
can be written in the following form: 

n(a, a*) = Cexp{-(a - eys-^{a - Oy}, (1.33) 

where 6 = {klo + 27rjA)7^^, and C = J/ dots if density (|1.33|l is normalized 
to J with respect to da da* — d^ dyy. Note that, as in the case with (|1.13|l . 
amplitudes H1.25|l must be written in the form 

\a){q) = (7/V2^)i//2gxp{(q7-Rea)at - Iqujq^] , (1.34) 

with the scalar product defined in l|1.14|l . However, in contrast to (I1.13|) . these 
amplitudes do not realize at Im lu ^ the lower bound in the uncertainty relation 
(|1.11|) while they do realize a more exact lower bound defined by the matrix 
inequality 

> 0, or CTpp > apqcr'^agp, (1.35) 
provided that matrix aqq is nonsingular. Here 

(Jpq = (Jr,i + '^*7" V (27rj), (Jqp = (J^r, - t^7" V (SttJ) 

are the elements of the correlation matrices, 



CTqq 


= 








Cqp 


= 




ptpa)'^ (da) . 




O'pq 






qipjf {da) , 


(1.36) 


^pp 






pipa)'^ (da) , 





which are defined for nonmixed canonical signals in (|1.30(l and satisfy, obviously, 
the following relation: 

'Tq{^p-p*ay)aq = l/{47Ty, (1.37) 

with CTp = (Tpp, (7g = aqq, and p — oj~^ Imw; at p = this relation realizes the 
bound of Hl.ll|l . Otherwise, it realizes the bound to (jl.35|l . 

1.1.5 Canonical Representations 

In problems dealing with the recognition of complicated sound and visual pat- 
terns, the most important information is usually contained in the momentum 
representation as well as in the coordinate representation. For example, in ana- 
lyzing speech not only the frequency distribution of its intensity is important but 



det 



'qq 

Tpq 



'qp 

^pp 
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so is its temporal distribution, in the same way as in colour pattern recognition 
it has proved important to know the wave structure in addition to the spatial 
structure, although there can be no joint coordinate-momentum representation 
that would enable calculating such distributions simultaneously (due to noncom- 
mutativity of position and momentum operators), the simultaneous estimate, 
say by the human ear, of the frequency and temporal structures of sound points 
to the possibility of building a mathematical model of such perception, which 
may prove extremely important for automatic speech recognition. 

The simplest models of such joint coordinate-momentum representations are 
those whose densities are defined as the intensities 

fc(z) = |(^J^)|2, z=(a;,y)eM2(d+l)^ (^33) 

of projections of amplitude (p on the canonical amplitudes H1.24|l at C, ^ z, which 
are parametrized at a fixed uj by the estimate vectors ^ ^ x and rj — y of the 
generalized coordinates x — (xq, . . . , Xd) and momenta y = (yg, ■ • ■ , Vd)- Wc can 
directly verify that the density in x has the form 

mix) = / k{x,y)dy = \v\-^^^ f e-^^^="^'>''''^' \ip{q)\^dq, (1.39) 



which in the limit of \v\ = Vdetvhj 00 coincides with the coordinate distri- 
bution (,(g) = |cp((7)p. Similarly, in the p-representation we find the density in 

y- 



i^iy) = I k{x,y)dx 

|-i/2 f e-"l(2'-f)^"'l'|y5(p)|2dp. (1.40) 



= V 



which in the limit of \v\ 00 coincides with the momentum distribution i(p) 
|^(p)p. Here for every amplitude if we have 

~ / in{x)dx — / / k{x,y)dxdy 



m{y) dy ^ \\ip\\- , (1.41) 

which means that the set of canonical amplitudes {i/j^ \ z € M^'^} is complete 
for every uj, with Keuj positive. Thus, the 2{d+ l)-parametric set {4^^} forms a 
nonorthogonal base that defines for each cu a canonical representation in which 
the diagonal elements of the kernel {tp^ \ S1P2) of ^ signal described by density 
operator S yield the density of the distribution of the signal's intensity l{S) = 
{S,I). For mixed canonical signals H1.14|l has the form 

kix, y) = J exp{- |c - a|'}n(e, r,) d^, dry, (1.42) 
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where c = {xuj + 2'Kiy)v^ specifically, for Gaussian signals 11.25|l we arrive 
at the following Gaussian density: 

k{z) = Jeyip {-i(z - 9)a-^{z - 0y} /y/det 27rcr^^, (1.43) 

where 

= acc + 2(u^j?^) + (u*,j?5^)A (1-44) 

is a (d + l)-by-((i + 1) correlation matrix (as usually j = V-T). 

A remarkable property of canonical representations is the possibility of calcu- 
lating the intensity distribution in any representation knowing only one canon- 
ical distribution by analytically continuing the density /e(c, c*) = k{x,y) to a 
kernel 

k{c',c*) = {c\S\c') (1.45) 

that is holomorphic in c' and c* G C'*^^, where |c) — 'i'z ^'^d similarly, |c') = ifj^, 
at 

c' = {x'uj + 27rjy')wV\/27r 

for z' = {x',y'). For one thing, for operator (|1.24l) with a density of the form 
H1.33(l we obtain 

k{c',c*) = J{c I c')e-<^'''-'">^'+'^"^^'~'">" /\S.\ . (1.46) 

The transition from kernels of the (|1.45|l type to, say, the coordinate represen- 
tation of operator S is carried out by the following formula: 

5 = J \c){c' \k{c',c*)ic\c')dcdc*dc'dc*', (1.47) 

where |c)(c'| : ip i— s- (c'|(/?)|c) are one-dimensional operators defined by the 
canonical amplitudes H1.24|l . respectively, at a = c and c' G C'+^j dcdc* = 
dxdy, dc' dc*' = dx' dy' , 

(c|c') = exp{-i|c|2 + cV-i|c'|2}. (1.48) 

Note that the canonical kernels H1.45|l . as operators S, act in the space of entire 
functions 

/i(c) =el^l'/2(<p I c), (1.49) 

which define the representation of amplitudes x G ''^ in the Bargmann space of 
all entire functions on C^^^, for which 

J |/i(c)|^exp|- |c|V2} dcdc* < oo. (1.50) 

Specifically, one-dimensional operators of density S — \ip)(ip\ have kernels 

kic',c*) = {c\v)iv\c'). (1.51) 



O'yx 



'yy 
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1.2 Mathematical Models of Wave Pattern An- 
alyzers 

In this section we will consecutively introduce and describe mathematical models 
of an ideal filter, a quasifilter, a disjoint selector, and a quasiselector that make 
it possible to move to arbitrary representations necessary for solution of the 
problems of best wave-pattern recognition based on measurements of pattern 
intensities in a single representation. We will also discuss a dilation theory, 
based on the work done by Halmos [211 and Neumark for designing ideal 
filters and selectors and their realization via indirect measurements, an idea that 
originated in quantum theory |46| . 

1.2.1 Ideal Filters 

The simplest measurement of a signal is the determination of the intensity of the 
oscillations in the signal in a given mode described by a vector ip normalized to 
unity, \\tp\\ = 1, belonging to the Hilbert space Ti of amplitudes (p admissible at 
the "in" terminals of the receiver and having a finite intensity /((/?) = {if \ if) < 
00. The oscillation amplitude in mode ^ is determined by the projection {tp \ ip) 
of the received amplitude p on direction -0, while the intensity is calculated 
according to the formula 

E^{^) ^ \ \ p) ^ \{p \ (1.52) 

similar to the transition amplitude of a quantum mechanical system from state 
(f to state Ip. The intensity given by formula H1.52|l is a positive quantity, just 
as probability is; however, it can assume values greater than unity (but not 
greater than the total intensity I{(p)). The appropriate measuring device acts 
as an ideal filter if it receives a signal ip completely, provided that ip and ip are 
coUinear, and does not receive p if ip and ip are orthogonal. A mixed signal 
described by a density operator S excites in mode -0 oscillations of intensity 

e^iS)^{S,E^,) = ii;\SiP). (1.53) 

More general analyzers carry out the measurement of the intensity 

E{p) = {p I E^) = (1.54) 

of the projection Ep> of the received signal on an arbitrary subspace of Ti. de- 
scribed by an orthoprojector E = E* E. For example, an audio frequency filter 
with a pass band A is determined by an orthoprojector E = /(A) on the 
subspace of amplitudes V'(/) with support A = <i> acting as the operator of mul- 
tiplication by the indicator 1(/, A) of set A. In a similar manner one can define 
spatial optical filters that cut out a visual field in a certain region of aperture 
A. 

The reader will recall that an orthoprojector is any linear operator E :TL ^ 
Ti, satisfying the condition E* = E = E^ , and to each Hilbert space £ Q Ti 
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there corresponds a unique orthoprojcctor for which £ = ETi.. Bearing in mind 
this one-to-one relation, we wiU describe such analyzers by orthoprojectors and 
call them ideal filters, that is, as noted earlier, filters that pass a signal without 
distortion while measuring its intensity if 95 G ETi. and that do not pass a 
signal if it is orthogonal to space EH. The set of all such filters is partially 
ordered with a smallest element and a greatest element for which we take the 
null operator and the identity operator /; specifically, filter A is stronger than 
filter B, or A> B, if orthoprojcctor B is greater than A, or AB = A. Maximal 
filters, with the exception of the zero filter, are described by one-dimensional 
orthoprojectors E-^ = \x){x\ acting according to the formula 

\x)ix\v = Xixlv) = {X\V>)X (1-55) 

and defining the normalized vector x to within a phase factor e^^. For every 
filter A there is a unique complementary filter A-^ such that A + A^ = I, with 
A^^ = A, and A-'- > B^ if B > A. The orthogonal complement A ^ A-'- 
possesses all the properties of logical negation: if A "passes" f : A^p = tp, then 
A"*- does not: A-^ip = 0, and vice versa, with A A A^ = and A V A-^ = I with 
respect to the operations of conjunction A and disjunction V, for which we take 
the upper and lower bounds 

AaB = sup{A,B}, AaB = iid{A,b}, (1.56) 

and with the duality formula being valid, or {AW B)^ = A-^ V B^. 

Generally, filters A and B arc said to be disjoint if A _L i?, that is, A(p = ip 
implies Bip = (Bcp ^ (p ^ A(p — 0), or, which is equivalent, if A-^ > B 
{B-^ > A); they are said to be incompatible ii A A B = 0, that is, if there is 
not a single signal that can pass completely through filter A and filter B in the 
sense that Aip = ip and B(p = ip. 

It can easily be shown that disjoint filters are incompatible, but not the other 
way round. For this reason the logic of filters is nondistributive, similar to the 
logic of quantum theory, which is also nondistributive. It satisfies a weaker 
condition of orthomodularity 

{A\J B^) AC = A\J {B^ AC)\i A<B <C. (1.57) 

It is in this respect that the logic of filters differs from Boolean logic, where 
incompatibility and disjointness mean the same. The intensity of a mixed signal 
S measured by an ideal filter E can be calculated via the formula 

e{S) = {S,E)=Tr:{ES). (1.58) 
1.2.2 Disjoint Selectors 

Complicated analyzers measure the intensities i?i((^) = |(Xi I (p) P of the received 
field ip simultaneously in several standard modes Xj € H, i = 1, . . . ,m, which. 
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if the normalization conditions ||Xil| = 1 for all i's, are necessarily orthogonal 
in view of the condition 



i=l 

Otherwise, the total received intensity '^^i Ei(Lp) could be greater than the 
total intensity I{(p) = of the received signal (p. Such analyzers act as 

disjoint selectors, or ideal selective filters that split the received signal f into 
orthogonal components ip^^ = {xi \ f)Xi-i * = l,...,m. Signal ip is received 
completely by a selective filter if Eip = ip, where E = Y^^i Ei is the appropriate 
nonselective filter defined by the one-dimensional orthoprojectors Ei on the 
subspaces generated by the standard modes Xi- 

More general selective filters are specified by arbitrary sets (or families) 
{Ei I i = l,...,m} of projectors Ei : Ti ^ H that satisfy the condition of 
pairwise orthogonality EiEk = for i ^ k. For example, a disjoint selector that 
measures the intensity of a signal in each region Aj of a Borel partition 

O = ^ Ai : Aj C O 

i 

is described by an orthogonal set of Ei = I{Ai) of indicators /(Aj) = {l(g, Aj)}. 
Note that such a set {Ei} may have an infinite number of members if space H is 
not finitc-dimcnsional: in this case the received intensity is determined for each 
(p by an absolutely convergent series ^iif) ^ where 

Eiiip) = I Ei,p) = \\Ei,pf . (1.59) 

A selective; measurement is said to be complete if the inequality E{p) < I{(p) 
is transformed into an equality for every (p £ H, that is, if J2^i = -^i 
a strong operator topology (7 is the identity operator in H) and is said to be 
maximal if all the Ei are one-dimensional. 

Complete filters are usually related to self-adjoint operators with a nondegen- 
erate discrete spectrum {xj} through the spectral decomposition (or expansion) 

oo 

A = ^XiEi, (1.60) 

with each set {Ei} being assigned a numbering self-adjoint operator N = 
EiiEi. 

In addition to discrete filters there is another important class of filters, known 
as continuous filters, which are related to normal operators with a continuous 
spectrum X C C^. In accordance with von Neumann's theorem, to each such 
operator there is uniquely assigned a projector- valued measure E on X that 
specifies the orthogonal expansion (or decomposition) of unity I = J E{dx), so 
that 

A = J xE{dx), V{A) = |x e H : J \x\^E{x,dx) < ooj . (1.61) 
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Here a family {Aj \ j = 1, . . . ,n} of pairwise commutative normal operators 
Aj : Ti ^ Ti has corresponding to it a selective filter described by a projector- 
valued measure E = {S)j=i on X C C™ that defines a spectral representa- 
tion A = j xE{dx) for the vector operator A = {Aj). It is with these vector 
selective filters that the measurement of the intensity distribution l|l.l|l in the 
coordinate region is carried out. Such a distribution is described by the or- 
thogonal decomposition of unity I = J I{dx) for the coordinate vector operator 
Q = {Qk, fc = 0, 1, . . . , d); the coordinates in the coordinate (or position) repre- 
sentation are given by the respective operator of multiplication hy q — (qk), so 
that 

/(A)v?(9) = l((7,A)^((7), (1.62) 

where 1(A) is the indicator of the Borel subset A C W^^^ : l{A,q) = 1 for 
g e A and l(g, A) = for g ^ A. The result of such a measurement is the 
continuous measure I{ip,dx) with a density 

L{ip, x) = I[ip, dx)/dx = \ip[x)\^ . (1.63) 

Note that the self-adjoint position operator 

Q = j ql{dq), V{Q) = |x e H : J q^\x{q)\^dq < (»| (1-64) 

has a domain of definition X'(Q) coinciding with H = L^{Q) only in the case of 
a bounded region O. Generally speaking, operator Q is only a densely definite 
operator, such as the frequency operator F = if I{df) in the case of a 
semi-infinite band ^ — [0, oo[ of the spectrum. 

In general, let X be an arbitrary set and B{X) the Borel algebra of its 
subsets. Every measure E : A G B i-^ E{A) with values in the orthoprojectors 
of the Hilbert space TC is said to be a disjoint selector, it is called a complete 
selector if E{X) = I. Disjoint selectors measure the intensity distribution in 
the received signal tp on X according to the formula 

Eiip, A) = (^ I EiA)ip) = \\EiAM' , (1.65) 

and define for each ip a positive measure on X of finite mass E{ip,X) < I{ip), 
coinciding with I{ip) = \\(p\\ in the case of a complete selector. 

We will say that selector E' on X' majorizes selector E on X (denoted 
E' > E) if there exists a measurable mapping f : X' ^ X with respect to 
which 

E{A) ^ E'{f-\A)) for every A e 6(X) (1.66) 

where f~^{A) = {x' e X' \ f{x') G A} is the inverse image of set A C X, and 
the selectors E' and E are equivalent, E' ~ E, if E' < E, too. 

1.2.3 Successive Filters and Quasifilters 

The common practice in processing sound and visual patterns is to use analyzers 
that act not in the initial Hilbert space Ti generated by the amplitudes ip on 
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the "in" terminals but in an extension of this space. For example, temporal 
measurements of sound signals y(/) with a restricted frequency band $ are 
reduced to determining the intensity of these signals in this or that temporal 
interval A g via the orthoprojector /(A) of multiplication of the signal in 
the temporal representation by the indicator function l(t,A): 

I{ip,A)= [ l(i,A)|^(t)|\l<= I{A)^ ' (1.67) 



where /(A) acts in the space of signals of unlimited bandwidth, L^(M). In a 
similar manner space Ti is extended to i^(R'^) in the wave processing of optical 
fields observed on a limited aperture S C , which results in determining the 
intensities of the fields in this or another momentum interval A G M'*. 

In general, such an extension is described by an isometric embedding F : 
Ti, ^ Ti' oi Hilbert space Ti into another space H' , for which for examples 
considered here we can take the Hilbert space Ti' = L'^{M.'^^-^) into which the 
space H — L^(57) is isometrically embedded via the Fourier transform F : 
(fi i-^ (fi^ . An ideal filter described in Ti' by the orthoprojector E measures the 
intensity of amplitude ip d Ti. defined by the Hermitian form 

D{ip) ^ \\EF>ff = {Fkp I EF^) = (v? I D^), (1.68) 

where F) = F*EF is a positive contraction operator in Ti. Formula H1.68|l 
shows that this intensity can be considered the result of successive action of 
two ideal filters, F and E, with Ti being identified with a subspace FTi C Ti' , 
where filter F is described by the orthoprojector F that cuts subspace Ti out 
of Ti' . For example, temporal measurement of narrow-band signals is the result 
of noncommutative action of a frequency filter F — I{Af) and a temporal filter 
E = I (At), the result is effectively described by the Hermitian form H1.68|l 
defined by the operator 

D = I{Af)I{At)I{Af). 

It is, therefore, advisable to generalize the concept of a filter by describing it in 
space Ti by any operator D : Ti ^ Ti that satisfies the condition 

I? > 0, (1.69) 

and calling it a quasifilter if D . 

The basis for this extension is the Halmos theorem [211 , according to which 
every quasifilter described by operator l|1.69|l can be considered as a reduction 
(projection) D — F*EF on Ti of an ideal filter E acting in an extension Ti'. For 
the Hilbert space Ti' we can always take the doubling Ti' ~ Ti ®Ti — €? ® Ti oi 
space Ti with embedding F : ip i-^ (<y5, 0), selecting the operators 



En = D, Ei2 - VD{I - D) = E21, E22 = I - D (1.70) 

for the blocks of orthoprojector E. Note that allowance for consecutive action 
of several noncommutative ideal filters i?i , . . . , i?„ in the initial Hilbert space 
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also leads to the notion of a quasifilter. These ideal filters then measure the 
intensity 

D{ip) = \\E,,...EM\^ = {^\D^), (1-71) 

D — El . . . En-lEnEn-l . . ■ Ei, (1-72) 

and the result can be considered the effect of linear nonideal filters not neces- 
sarily described by Hermitian contraction operators A : H ^ H, \\A\\ < 1, with 
the nonideal filters damping and distorting the amplitudes and with D ^ A* A 
in the formula for the appropriate intensity: 

D{^) = \\A^\\' = I D^). (1.73) 

1.2.4 Quasiselectors and Indirect Measurements 

In a similar manner we can introduce generalized selectors, which are defined on 
a Borel space X by a positive operator-valued measure M : A g B{X) M(A) 
specifying in the Hilbert space H a weak decomposition D = J M{dx) of an 
operator D satisfying condition p. 69(1 in the following sense: 

D{ip) = j M {ip, dx) for every ^p. (1.74) 

Here, as usual, D{(p) = {cp \ Dip) is a Hermitian form defined by the operator of 
total effect D, while 

M{(p,dx) = {(p\ M{dx)ip) (1.75) 

is the distribution of intensity of X corresponding to amplitude ip and measured 
by such a selector. Note that the expansion of operator D defined by measure 
M{dx) may not necessarily be orthogonal even if the operator is a projector, 
that is, if the total filter is ideal, such nondisjoint selective filters will be called 
quasiselective filters, or simply quasiselectors. A quasiselector is said to be 
complete if M{X) = I and maximal if M < M' M' ~ M in the same sense 
as in lfLHH|l . 

An example of a complete maximal quasiselector for sound signals and op- 
tical fields observed in a restricted region Vl of coordinates M'^+^ 3 q — {f, q) is 
the analyzer of the momentum distribution ((1.4(1 , which in the g-representation 
is defined by formula ((1.75(1 via the operator-valued measure 

M{dp) = F*I{dp)F = I{dp), (1.76) 

where F is the Fourier transform ((1.51) . and I{D) — {l{p,D)} is the projector- 
valued measure on E'*+^ described in the space H' — L^(]R'^+^) of the proper 
representation of the generalized- momentum operator p — {t, p) by the indicator 
measure l(p, dx). 

Note that the momentum operator defined in 7Y = L'^{ fl) by the nonorthog- 
onal integral expansion 

P - y pl(dp), V{P) - |x e H : j p^\x{p)?Ap < oo 
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is always unbounded with a spectrum and, for ^ R''"'"^, non-selfadjoint, 

notwithstanding the fact that the form of the total momentum P{ip) = J pi {if, dp) 
is always Hermitian. Nevertheless, this operator always uniquely defines a 
nonorthogonal expansion I = J I{dp) via the condition 

J pHx I ndp)x) = [Px I Px) Vx e V{P), 

and is a restriction to functions tp{q) = for q ^ £7 of the operator {2Tri)~^d/dq 
that is self-adjoint in L^(]R''+^) with a domain of definition 

V{P) = {ipe i2(M'i+i) : \\d^ip/dq^f < oo}. 

For instance, time operator T = j tl{dt) in the space Ti, — with a 

semi-infinite band $ = [0, cxd[ is a symmetric but not a self-adjoint operator 
{2n])~^ d / d f with a domain of definition 

V{T) = |x e X2(o, oo) : x(0) = 0, |ax(/)/a/pd/ < ooj . 

Similarly, the validity of the representation 

M{dx) = F*E{dx)F, F*F = /, (1.77) 

for an arbitrary quasiselector M in the form of the projection of the disjoint 
selector E described in the extended space Ti.' by an orthogonal projector-valued 
measure E{dx) is ensured by the Neumark theorem '47' . For mixed signals the 
intensity, as a function of the density operator S, is described by a measure 
/i(5', dx) defined by the following linear form: 

^i{S,dx) = {S,M{dx)) . (1.78) 

Quasiselective filters also emerge as a result of reducing the description of 
indirect measurement of the received signal via the disjoint selection Eo{dx) of 
the initially uncorrelated reference signal interacting with the received signal; 
this reference signal generates a Hilbert space Hq. Specifically, if 5*0 is the 
density operator of the normalized reference signal (Tr So = 1) and U is a unitary 
operator describing in the tensor product Ti.<S)Ti.o the result of the interaction 
S' = U{S (E) So)U* with the received signal S, then the intensity distribution 
corresponding to such indirect measurement may be effectively calculated via 
formula (|1.78|l as a result of the quasimeasurement 

{S,M{dx)) = {S',I(g) So{dx)) = {S(8)So,E'{dx)) 

described by the operator-valued measure 



M{dx) = Tr [{I (g) Sa)E'{dx)\n] , 



(1.79) 
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where E'{dx) = U*{I (g) Eo{dx))U, and Tr {-IH} is the partial trace inH'gj'HQ 
defined for factorable density operators S ^ Sq via the formula 



The indirect calculation of the intensity distribution over the frequency (colour) 
/ € $ of static monochrome patterns is an example of the above-mentioned 
type of measurement. It can be carried out as a result of the wave processing 
of the patterns in which the intensity distribution over the momenta p in such 
patterns is calculated. 

Using the Neumark theorem as a basis, let us give an explicit description 
of a construction that makes it possible to reduce any quasimeasurement to 
an indirect measurement. To this end we take for Tio the space Ti.' of the 
Neumark construction and for the reference signal a normalized amplitude ip' = 
Ftp, WtpW = 1, and introduce the linear operator (7 in 7i (g) Ti' (which is defined 
by the Neumark isometry F : H ^ H', F*F = I) in the following manner: 



with the generating elements being (p ^ (p' , (p ^ H, (p' E H' . It can be directly 
verified that U = U* and = I and, hence, U*U ^ I ^ UU*. Taking for 
Eo{dx) the Neumark expansion E{dx) in H', assuming that 5*0 — and 
allowing for H1.77|l , we arrive at an indirect measurement whose reduction H1.79|l 
yields the initial measure M{dx): 

Ty{{I ®\i}')(tP'\)E'{dx)} = (V- I Tp)F*E{dx)F = M{dx). 

1.2.5 Canonical Operators and Measurements 

Bearing in mind the invariance of the domains of definitions of operators Q and 
P with respect to the self-adjoint operators of multiplication by q and differen- 
tiation with respect to q, or (27rj)~^ d/dq, which onH — L^(ri) coincide with Q 
and P, respectively, in what follows we will take for Q and P in L^(R'^"'"^) their 
extensions, while always assuming that the region where these operators act is 
n C R*^"^^. Such operators are known as canonical and satisfy the commutation 
relations (|1.9|) in the common domain 2?(P) n ^{Q). 

Let us now discuss simultaneous measurement of the coordinate (or position) 
and momentum distributions. In view of the noncommutativity of Q and P, 
there can be no joint orthogonal decomposition of unity for these operators; 
there can even be no joint nonorthogonal decomposition of AI{dxdy) for which 
the following would be true 



Otherwise, in view of the Neumark theorem, there would be commutative self- 
adjoint operators in Ti' Z) Ti coinciding on Ti. with the noncommutative opera- 
tors Q and P, which is impossible. 



U : ip^ip' ^ F*ip' iSi Fip + p iSi {I ~ FF*)p' 



(1.80) 




(1.81) 
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Another interesting question is the relation to these operators of the mea- 
surements of the canonical distributions (|1.38() . Such canonical measurements 
are described, obviously, by continuous with respect to dz = dx dy nonorthog- 
onal measures K{dz) ~ k{z)dz with projector-valued densities 

fc(z) = |VJ(V'.l = |c)(c|^fc(c,c*), (1.82) 

which are defined by canonical amplitudes (|1.25l) at C = ^ and a certain uj or, 
in complex variables, by (| 1.34(1 at a = c. The respective quasiselective filters, 
which are parameterized by symmetric co matrices with a nonsingular real part 
Lo + Lu* and now will be called canonical filters, are, obviously, maximal and, 
because of (|1.41ll . complete: 

|VJ(^.|dz-/ = II |c)(c)dcdc*, (1.83) 

where dcdc* — dxdy. 

By fixing co and directly integrating we find that the quasimeasurement of 
an intensity distribution in a; g M''+^ is described by a continuous measure 



M{dx) = / K{dxdy) = ■m{x)dx 
diagonal in the g-representation, 

m{x) ^ [ k{x,y)dy = \v\~^ I e-^l(^-9)^"'l'/(dg). (1.84) 



with a Gaussian density and v*v = 2ti{uj + a;*)^^, while the quasimeasurement 
of^an intensity distribution in y G R*^^^ is described by an operator measure 
M{dy) = J K{dxdy) = m{y)dy, where 

m{y) = I k{x, y)dx = \v\-^ ^ e-"'(^-f)""''7(dp), (1.85) 
with a Gaussian density and 

v^v = 27r(cD + cD*)"\ 

where 2/27r = 27: /lj* . The operator measures M and M on R''+\ which define 
nonorthogonal expansions of operators Q and P, that is, 

Q = y 2;m(x)da; and P = I yrh{y)dy, (1.86) 

describe, in contrast to the spectral measures / and /, inaccurate measurements 
of position and momentum distributions, which are obtained by smoothing out 
(|1.39|l and l|1.40|l with Gaussian weighting functions m and m. Nevertheless, 
the canonical operator measure K that generates the two spectral measures 
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possesses certain spectral properties with respect to the complex-valued combi- 
nations of the two respective operators: 

A = ^={Quj + 2niP)v'' . (1.87) 

Namely, applying A directly to the canonical amplitudes H1.34|l . we can easily 
verify that it is well-defined on these amplitudes: 

A\a) ^ a\a), a £ C'^+\ (1.88) 

which, therefore, form a proper base for A in TC = L^(M''"'"^). 

Hermitian conjugate operators C = A* are diagonal in the Bargmann rep- 
resentation, 

idh)ic) = el^l'/2(Cx|c) = el^l'/2^(x|c) = c/i(c), 
with a domain of definition I?(c) = {x G ?i : ||c7i|| < oo}, where 

\\chf^ J |c|'/i(c)|2e"l"l'dcdc*, (1.89) 

on which domain there is also defined the operator A by differentiation (ah) (c) = 
dh{c) / dc. Thus, in the initial representation we obtain the nonorthogonal " spec- 
tral" decompositions 



A = y cK{dz), A* ^ j c*K{dz) 

V{A*) - |xeH:y|cn(x|c)2dcdc*<oo|. 

Now let us describe a simple realization of a canonical measurement by an 
indirect measurement defined in the tensor product 7i(8)7ioi where TLo is a copy 
of Ti. To this end we take the commutative self-adjoint operators 

X = Q®Io + I®Qq,Y = P®h-I®PQ, (1.90) 

where Qo = lo-, Po = (27rj)^^9/(?(7o, and Iq is the identity operator in TIq = 
^2^]grf+i^^ Suppose that E{dz) is the orthogonal spectral measure of the set 
Z = {X, Y) and that 

V'o(9o) = l^^^^r/^expj-igo^qT} ^ |o)p (i.gi) 

is the basic canonical amplitude in Tip- We take an arbitrary amplitude x G ^ 
and the corresponding tensor product 

(x ® ■4^o){q, Qo) = x(9)V'5(9o) 

in 7i©7io and define the characteristic function of the corresponding distribution 
thus: 

T{u,u*) = J ej("*^^+^*"^'(x® I E{dz)x®ro), (1-92) 
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where u, u* € C^'^'^"'""'^^ and as usual, 

c = {xuj + 27rjy)w^. 

\/2tt 

We write this function in terms of normal operators 

S= I cE{dz)^^={Xuj + 2TTiY)v^ ^A(E)Io + I<»Co, (1.93) 
with A the operators p.87|) and 

Co = ^{Qoiv - 2TTiPo)v\ 

in the form 

where we have allowed for the property CqV'o = fo'' the basic amplitude (|1.91|) . 
Employing now the completeness property (|1.83() of canonical amplitudes, we 
obtain 

T{u,u*) = /(ej^*"'x|c)(c|ej^*"'x)dcdc* 

ej"'"'+j"*"'|(x|c)pdcdc* = fc(u,w*). 

Thus, the characteristic function p.92|l of the indirect measurement of intensity 
of amplitude x G 7i coincides for the ground state "00 j when calculated in the 
z-representation of operators (|1.9U|) . with the characteristic function k of the 
canonical distribution fc(c, c*) = |(x | c)p for this amplitude. 



Chapter 2 



Optimal Wave Detection 
and Discrimination 



In this chapter we develop the wave theory of hypothesis testing for solving 
problems of optimal recognition of sound and visual patterns. We formulate 
the necessary and sufficient conditions for the optimality of two-alternative and 
multialternative detection of wave patterns according to the maximum criterion 
for the measured intensity of acoustic signals and optical fields. We consider 
problems involving the discrimination of a wave pattern against an acoustic or 
optical background, problems involving the discrimination of pure nonorthogo- 
nal signals and fields, and problems involving the recognition of mixed patterns 
described by noncommutative density operators. Complete solution of the last 
type of problem is then obtained for the case of mixing two pure patterns. The 
discussed results of solution of the corresponding extremal problems follow from 
the methods of linear programming in Banach partially ordered operator spaces 
|43) . The results generalize the corresponding results of the quantum detection 
and estimation theory, which have been obtained for the two-alternative case by 
Helstrom ISO! and for the multialternative case by Belavkin ^1] , ^21 • The nec- 
essary and sufficient optimality conditions for the quantum theory of hypothesis 
testing have been discussed by Kennedy 001) Yuen and Lax Kholevo 
Belavkin ^3], and Belavkin and Vancjan |1(J| . 



2.1 Optimal Detection of Sound and Visual Pat- 
terns 

In this section we will discuss the problem of detecting wave patterns that are in 
a partially coherent superposition with an acoustic or optical background. The 
problem is complicated by the presence of interference. We start by considering 
the superposition principle for generalized mixed amplitudes. We then formulate 
the necessary and sufficient conditions for the optimality of detection and give 
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solutions to a number of problems considered in the quantum case in the review 

m 

2.1.1 The Superposition Principle 

The problem of detecting a sound or visual pattern described by a wave ampli- 
tude (p{q) taken from the Hilbert space H — L^(r2) can be solved in a trivial 
manner by measuring the total intensity I{f) = ||v|| only in the absence of 
an acoustic or optical background consisting of other signals and fields in the 
frequency-spatial region Q considered. If in the region of measurement there is 
another signal or field described by amplitude (/Jq G 7i the question of whether a 
wave pattern ip is present cannot generally be unambiguously solved by simply 
measuring the total intensity of the resulting amplitude ^p, which may be higher 
or lower than the background intensity. Such a phenomenon is called interfer- 
ence and is the result of the wave superposition principle tp + (p + (p^, according 
to which the complex-valued amplitudes of the coherent signals and (^q rather 
than the intensities of these signals, are added. The intensity of the resulting 
signal has the form 

||^||' = ||^||V2Re(^|^o) + ||^o|^ (2.1) 

To describe the result of the superposition of a mixed pattern and a partially 
coherent background caused, say, by thermal fluctuations that have an infinite 
total intensity of the acoustic or optical field, we can employ the correlation 
theory by considering generalized random amplitudes within the second-order 
statistical theory. 

Partially coherent signals and fields determined in a similar manner in the 
framework of the classical or the quantum theory are commonly described by 
bounded operators F from Ti. to another Hilbert space /C; for ordinary nonran- 
dom amplitudes ip G Ti. these operators are usually represented by the functional 
-FVx = (V' I x)j denoted by = {tp\ and acting from Ti. to JC = C The mean 
intensity of random oscillations excited in mode x G Ti, ||x|| — 1, is determined 
by a Hermitian form in F: 

E^{F) = {Fx I Fx) = (x I F*Fx), (2.2) 

and is calculated for common mixed signals via formula H1.53|l with the aid of 
a (generally infinite trace) density operator P — F*F oi the intensity t(P) £ 
[0, oo], where F* is the Hermitian conjugate operator K, H acting for F ^ {ip\ 
as an operator of multiplication c i—^ ipc from K = <C to TI. The intensity (|2.2|l 
is a measurable quantity bounded by the norm of the positive operator P and 
equal, via the duality theorem, to 

SxiP) = ix I PX) < \\P\\ = inf{e I el > P}, (2.3) 

which for the case of white noise P = el, described by the isometric operator 
T — F/y^, rp*rp _ determines the local intensity e — e^(P), the same for all 
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modes x ^T^- Note that every partially coherent signal F can be considered as 
the result of action of a contraction filter D = P/ \\P\\ on white noise of local 
intensity e = ||P|| if we employ the polar decomposition F = TP^^^, which 
determines uniquely the isometry operator T on the range of values F*JC of 
operator F* . 

For generalized signals with infinite trace density operators P it proves expe- 
dient, however, to consider only such quasimeasurements for which the operators 
D of the total effect lead to finite intensities: 

D{F) = Tr (FDF*) = Tr (PD). (2.4) 

In addition to one-dimensional projector D = E^ = |x)(xli for which the in- 
tensity H2.2() is determined by the bounded form 12. 311 . we can always consider 
finite-dimensional operators D — |Xi)(Xil ^-s well as any trace class operator 
0<D<I, since Tr (SB) < eTr D ii S < el. 

Extending the superposition principle to generalized amplitudes F, Fq : 7i — > 
/C, we find that the result G — F + Fq of addition of the generalized signal F 
and the background Fq is described by a density operator R = G*G, that is the 
sum of operators P = F*F and Pq = Fo*i^o only if ReF*Fo = 0. The latter 
condition, which defines the incoherence relation between F and Fq, cannot be 
met for nonrandom amplitudes F = (ip\ and Fq — (<Pq| since F*Fo = \ip){iPq \ ^ 
even in the event of orthogonality ((p | (pg) = if (/3 or t/jg ^ 0, although the 
total intensity l|2.1|l is equal to the sum ||(^||^ -I- ||(/3ol|^• 
Generally, the resulting density operator R can be represented in the form 

i? = P + pi/^^Pg/a ^ pV2^*pi/2 ^ (2.5) 

where C = T*Tq is the operator of mutual coherence of signal F = TF^^^ and 
noise Po = TqP^^^ determined by the partial isometrics T : F*JC Ti and 

Note that C is a contracting operator: 

||C||<||T*|| llTolHl, 

and a partially isometric operator if FqH. C FH. The latter condition deter- 
mines the coherence relation between the generalized amplitude Fq and ampli- 
tude F, which is always met for nonrandom amplitudes F = {ip\ and Fq = ((/JqI 
for which FqT-C = C = FH. Representing the partially coherent amplitude Fq 

1/2 

in the form of a sum of the component Hq = TCPq coherent with P and 
the component W = Fq — Hq that is incoherent and doing the same with the 
resulting amplitude G, or 

G^Hi + W, 

where Hi — F + Hq, we can isolate from operators Pq and R a common density 
operator of the incoherent background N — W*W by writing the two operators, 
with allowance made for the fact that W*Hi = 0, in the form Pg = S'g + iV and 
R=Si+N, where 5, = H*Hi are the operators Sq = Po/^C*CPo/^ and 

Si=P + P^'^USy^ + Sl'^U*D^/^ + Sq, (2.6) 
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with U the partially isometric operator of polar expansion, and CPq = USq . 
In contrast to Pq and R, for a trace class operator P the operators Si are usually 
also trace class operators of rank r{Si) < r{P) and one-dimensional operators 
if r(P) = 1. 

Infinite trace operators P may also be replaced with trace class operators 
if we consider finite total intensities (|2.4|) with respect to a fixed D, assuming 
that S = D^^'^PD^/^ . The effective operator D is then replaced with the or- 
thoprojector E on the subspace £ — DH that determines the total intensity 
e{S) = Tr 5 by taking the trace e{S) = Tr (ES) on £. 

2.1.2 Classical Detection 

The simplest detection problem, that of isolating a pattern described by a kernel 
operator P > from an incoherent mixture P = P -I- of this pattern with 
the background N, is solved by measuring the intensity of one of the possible 
signals, Rq — N or Ri — R, by comparing this signal with the background level 
i{N) = TtN. To this end it has proved sufhcient to limit oneself to measuring 
the total degree of contrast l(C) = Tr C of the received signal by calculating the 
trace (C, E) = Tr (GE) of the appropriate operator Ci = Ri — N, i = 0,1, on any 
subspace £ = EH., CE = C, with the trace assuming finite values {Cq,E) = 
in the absence of a pattern, i — Q, and (Ci, E) —TrP,i — 1, in the presence of 
a pattern even for an infinitely high level of the background l{N) = oo. 

In the case of a partially coherent superposition R of pattern P and back- 
ground Pq , the difference C = R — Pa may be a nonpositive trace class operator 
with a zero or even negative trace, with the result that the detection criterion, 
which is based on the condition that the total degree of contrast t(C) is posi- 
tive, may lead to incorrect results. Even if t(C) is positive, which is the case 
when the superposition tp ^ ip^ + ip oi orthogonal amplitudes, {ip \ ip^) ~ 0, is 
coherent, that is, 

we can considerably increase the degree of contrast of amplitudes ipQ and "0 if 
we sum, say, the coordinate distribution of the degree of contrast, 

c{x) = - = \^{x)\' -f 2Re^*(x)^o(x), (2.7) 

not over the entire region Q but only that part of the region where c{x) is 
positive. As a result we arrive at the following classical problem of optimal 
detection of a pattern in a coordinate (frequency-spatial) region fl: we must 
find a measurable subregion A° C f2 in which the upper bound 

>f?(C) = sup (C,/(A)) = / c{x)dx (2.8) 

of the integral of the contrast function c{x) = C{x, x) is attained. 

This function is determined by the diagonal values of the kernel C{x',x), 
which is the difference between the generalized matrix elements R{x',x) and 
N{x' , x) of operators R and N in the coordinate representation. 
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It is sufficient to consider the supremum (|2.8|l in the class of measurable 
subsets A C of the coordinate region = {x £ E'*+^|c(a;) ^ 0}, the support 
of the integrable function c(x), in which the supremum is attained only on the 
set 

A° ^{xen\c{x)>0} = n+. (2.9) 

Its value, >c°j(C) — J^^ c{x) dx, coincides, obviously, with the integral over fl of 
the positive part 

c+{x) = max{0, c{x)} = i(c(x) + |c(a;)|), (2.10) 
where the functions c determine the solution to the duality problem 

(c)^ = inf|y" b{x)dx\b>c^=J^c+{x)dx. (2.11) 

The lower bound (|2.11|l over all positive integrable functions b{x) E L^(n), 
majorizing almost everywhere the function c, is attained at 6° = V c = c+ 
and determines on the space of integrable functions c a positive gauge (c)_j_, 
which is zero only when c < 0. The set (|2.9|l specifies the optimal band of the 
frequency-spatial filter in which the best quality of detection, (|2.8|l , is achieved. 

Reasoning along similar lines, we can solve the problem of optimal detection 
in the momentum (or temporal- wave) space X — 

K°{C) = sup (cJ{A)) = [ c{x)dx, (2.12) 

where c{x) = C{x,x) are the diagonal elements of the difference R{x',x) — 
N(x' , x) of the operators R and N in the momentum representation; in coherent 
superposition these diagonal elements are 

c{x) = mx)f ~ \^,{x)f = \^{x)f + 2Re(^*(x)i^o(a;). (2.13) 



The quality of such detection, >Cj(C) ~ y-^/ ' based on a momentum quasimea- 
surement may differ considerably from H2.11|l . For example, the canonical am- 
plitudes H1.25|l ipQ = tpQQ and t/j = i/'oiji which are similarly localized in the 
coordinate representation, differ by their momenta, 77 7^ 0, and can be thought 
of as two hypotheses, corresponding to the absence and presence of a complex- 
valued amplitude ip = ip^^ — ip^Q in the coherent superposition tp = ip + ipQ, that 
cannot be distinguished by the measurement of 

(c+ — since c{x) — for all x E il). At the same time, such wave packets are 
easily distinguished in the momentum representation: 

C) = \v\-^ [ (e-'^K^-")^"'!' - e-'^l^^"'!') dx ~ 1 
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if ^ 1 since in this case c+{x) ~ |'(/'Q^(x)p. 

In general, for every quasiselective measurement of intensity on a Borel space 
X with a positive operator measure Af (A) < I, A C X, optimal detection is 
determined by the solution to the problem 

xIj {C) = sup (C, M(A)) = >c{A°) (2.14) 

ACX 

of finding the upper bound of the degree-of-contrast measure >f(A) = (C, M{A)) . 
The supremum H2.14|l is attained on the |>ir|-measurable set A°, the support of 
the positive part 

= 0W {k+ |>f|)/2 

of measure >c: 

A° = n{A : K+{A) = 0} EE A+, (2.15) 
which realizes the lower bound in the positive measures A > >f of finite variation: 

{k)^ = inf {^(^) I A > = >c+{X), (2.16) 
which determines the gauge {x) — <^ < oi measure >c. 

2.1.3 Optimal Detection 

As the example in Section . 1 . 21 shows . the quality of detection, which is deter- 
mined for a given intensity distribution on X by the degree-of-contrast measure 
^{C,A) = (C, M(A)), must be optimized not only with respect to measure- 
ment regions A C X but also with respect to the methods of measurement of 
this quantity. These methods are determined by the ways in which the positive 
operator-valued measure M{A) < E is specified, where E is any orthoprojector 
in Ti. satisfying the condition CE = C. Here it is sufficient to find at least one 
resolving operator D — M{A) that realizes the upper bound of the maximal 
degree of contrast (|2.14|l : 

x°{C) = sup{(i:>, D)\D< E}. (2.17) 

_D>0 

Employing the methods of linear programming in partially ordered Banach 
spaces 031 J can formulate the necessary and sufficient conditions for the 
optimality of the detection operator D employing criterion (|2.17|l . which is de- 
termined by the trace class degree-of-contrast operator C = R — Pq. 

Theorem 1 The upper bound (|2.17|) is attained on operator < D° < E if 
and only if 

B°{E- D°) = {), {B°-C)D° = 0, (2.18) 
where B° > 0, C . The operator B° here is the solution to the duality 

(C)^^ M^{{B,E)\B>C} (2.19) 

for which the conditions (|2.18() for admissible D° are also necessary and suffi- 
cient, with x°{C) — (C)_|_. 
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Proof. The sufficiency of the optimaUty conditions (|2.18|) for solving problems 
H2.17I) and H2.19() can be verified directly by employing the property of the 
monotonicity of the trace, 

B>C^Tt {BD)Tt {CD), 

for every positive operator D. Allowing for the fact that B°E = B°D° = CD° 
for every < D < E, we obtain 

(C, D) = Tr {CD) < Tr {B°D) < Tr {B°E) = (C, D°) . 

Similarly, for every B > and every C we obtain 

{B, E) = Tr {BE) > Tr {BD°) > Tr {CD)° = {B°, E) . 

The necessity of the optimality conditions (|2.18() follows from the fact that the 
inequality 

(C, D) = Tr {CD) < Tr {BD) < Tr {BE) = {B, E) , (2.20) 

which is valid for all operators D and B admissible in problems (|2.17|) and 
H2.19|l . must transform into the equality 

{C,D°) = {B°,E) 

on the extremal operators D° and B°, in accordance with Lagrange's principle 
of duality: 

sup{{C,D) \ D < E} = sup M {{C,D) + {B,E - D)} 

D>0 D>0-R>0 

= m{ sup {{C - B,D) + {B,E)} 

E>0 D>Q 

= mi {{B,E)\B>C}. 

B>0 — J 

Whereby, allowing for the fact that the trace of the product of positive operators 
is zero if and only if the product itself is zero, we arrive at conditions H2.18|l via 
the following relation: 

Tr [B°{E - D°)] + Tr [{B° - c)D°] = Tr {B°E) - Tr {CD°) = 0. 

The proof of the theorem is complete. ■ 

Note that the solutions to problems H2.17|l and (|2.19|l exist for every Hermi- 
tian trace class operator C and every bounded positive orthoprojector E; the 
solution to problem (|2.17(l is unique only if E is the minimal of the orthopro- 
jectors for which CE = C, while the solution to problem H2.19|l is unique only 
if E is the maximal E = I oi the orthoprojector E. Indeed, employing the 
spectral representation of operator C, we write this operator in the form of the 
orthogonal sum 

C = Y,>Cr.\xJ{Xn\=C+ + C- (2.21) 
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of the positive and negative operators 

C+=Y1 ^n\Xn){XnlC^ = E ^"lxJ(xJ, (2-22) 

where we have aUowed for the fact that a Hermitian trace class operator has a 
discrete spectrum of finite multiplicity, Xn G M, which can be found by solving 
the eigenvalue problem Cx = >^X- The orthoprojector E satisfying condition 
CE = C can be written in the form of the orthogonal sum 

E = E+ + Eq + E_, (2.23) 

where Eq — E — E+ — E^ with 

E |XJ(X„U-- E \Xn)iXnl 
>f„>0 3<„<0 

The operators D° = E+, B° = C+ arc, obviously, admissible: < E+ < E, 
C+ > 0, C and optimal: 

C+{E-E+) = C+{Eo + E) = {), 

(C+ - C)E+ = -C-E+ = 0. (2.24) 

Every other solution D° to problem 1)2.17(1 satisfies conditions H2.18|l for B° = 

C+{E-D°) = C+-C+D° = 0, 
(C+ - C)D° = -C^D° = 0, 

in view of which E+ = E+D° and E^D° = 0, that is, 

E+ < D° < E - E_ = E+ + Eq. (2.25) 

Similarly, every solution B° to problem (|2.19|) satisfies conditions (|2.18|) for 
D° = E+: 

B°{E - E+) = 0, {B° - C)E+ = B°E+ - C+ = 0, 

which imply that B is commutative with E^ and, hence can be represented in 
the form of the orthogonal sum B° = B^ + Bq, with 

B+= B°E+ = C+, Bq{E-E+) = Q, 

that is, 

B° ^C+ + Ba, Bo > 0, BqE = 0. (2.26) 

Thus, the general solution to the problem of optimal detection is determined 
by the quasifilter 12.25|l of the form D° = E+ + Dq, where Dq is an arbitrary 
operator, < Z^o < ^^o, and an ideal fiher D° ^ E+ if E = E+ + E^. The 
general solution to the duality problem 1(2.1911 is determined by the operator of 
the form ((2.2t)() , with Bq = at E ^ I . The maximal possible degree of contrast 
realized by the optimal detector D° is given by the expression 

x+{R - Po) - Tr (i? - Po)+ = E (2.27) 
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2.1.4 Coherent and Quasioptimal Detection 

Let us consider the particular problem of optimal detection of a wave pattern 
described by a common amplitude </? G 7i in a partially coherent mixture with 
a generalized random amplitude Hq : 7Y ^ /C. The resulting amplitude 

with ^ € /C a normalized vector ||^|| = 1, defines a density operator R = G*G 
of the form 

R = \^){^\ + |^)(^o| + + Po, (2.28) 

with (/7q = Fq*^ G Ti and Pq = F^F^) the background-density operator. Thus, 
we are required to solve the extremal problem (|2.17() for the two-dimensional 
degree-of-contrast operator C — R — Pq of the form 

C = \ip){ip\ + \^)iipa\ + \^o)if\ 

which corresponds to the coherent superposition ip = ip + oi the common 
amplitudes ip and (pQ. We will consider this problem in the minimal subspace 
£ C H generated by the amplitudes tpQ = (pQ and tp-^ ~ (p^+p. For its solution we 
find the eigenvectors and eigenvalues of operator C by constructing the secular 
equation Cx = ^^X for the coefficients of the expansion 

X = aoV'o + "iV'i 
in the base {ipo, ^i} of space £: 

ipiiipi I aoipo + aiV'i) - V'o(V'o I "oV'o + "iV^i) = ^("oV'o + o^ii'i)- (2.29) 

Introducing the notation i^i = HV'ill^ i i ~ 0,1, P = {ipQ \ "01), and equating the 
coefficients of V'i, * = 0, 1, in H2.19|l . we arrive at a system of two homogeneous 
equations, 

(i/Q + x)ao + I3ai = 0, /3q;o -I- {vi - x)ai = 0. (2.30) 
This system has nonzero solutions only if the system determinant is zero, or 

+ =Q. (2.31) 

Solving this quadratic equation for k, we obtain the eigenvalues: 



-±-^±Vl^J -l/^P, (2.32) 
which are real in view of the Schwarz inequality 
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and, obviously, have opposite signs: ±k± > 0. At /3 = the amphtudes ipi 
and tpo nieasuring the degree of contrast of oscillations in the resulting mode 
= ipi/y/i^i which is equal to the intensity of oscillations at >ir = i^i in this 
mode if the received signal is ■0i and to zero if the received signal is '00 • In the 
opposite case |/3p = I'oi'i of the colinearity of ipi and ipQ, the values >f± are 
equal respectively, to the positive and negative parts of the difference z^i — i^g: 

x± = ^(^^1 - i^o ± Wi - >^o\) = {t^i - i^o)±- 

The corresponding optimal detection is reduced to the measurement of the pos- 
itive degree of contrast x+ = vi — i/q in the mode x = V'l/v^ ~ "00/%/^ if 
vi > vq, in the opposite case, vq > vi, the degree of contrast xj^ is zero and 
no measurement is carried out, or x+ — 0. The optimal detection of a wave 
pattern ip of intensity /i = ||<p||^ 7^ in the coherent superposition ^ = Lp + Lp^ 
is therefore reduced to the measurement of the maximal degree of contrast 



x+ = VM^(^Re7 + V7+V(I^e^ + V7)' + l-|7pj , (2-33) 

where 7 = ((/?o | p) / ^/T^ is the coefficient of colinearity of amplitudes ip and p^, 
and A is the signal-to-noise ratio. The corresponding ideal filter Ej^ = |x+)(x_|_| 
is defined at >f+ 7^ by the eigenvector x+ — OL+'f + 0:0 with coefficient 



a+ = ^/vo/mo (^j Im7 + V7 + \/(Re7 + VA)^ + 1 - |7|2^ 



(2.34) 



ao > 0, found from the normalization condition ||x+|| = 1- The case where 
K = 0, and therefore x+ = 0? is possible in the minimal subspace £ only if p 
and Pq are colinear, when I7I = 1, and 

>f+ = ^/JwQ{cosd + VA)+, (2.35) 

where cos6' = Re 7. The optimal filter in this case is matched with the signal 
mode x+ ~ 'fl\fp^ if cos B > —\f\ and x = in the opposite case if cos B < —^/X 
which is possible only if A < 1. 

The same filter Xo — f/y/jj^ matched with -0 is used to describe the asymp- 
totically optimal detection at large signal-to-noise ratios 

A = l/e> (Re7)2. 

The degree of contrast is then 

>fo = M(l + v^Re7), (2.36) 



which coincides with (|2.33|) to within e. In the next order we obtain a fil- 
ter matched with the resulting mode Xi = V'/V^ a-nd realizing the degree of 
contrast 

'^V 2 '4 4[H-(yj/2)Re7 + £/4]y ^ ' 
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For an orthogonal background, ip L (p^, we have 7 = 0, and the normaUzed 
eigenvector x+ corresponding to the eigenvalue 

x+ = (\/1 + A + \/a) (2.38) 

can be written in the form 

{fo/V^ + (vi + A + V\) ^/^) 



x+ = 



2(1 + A) + VA(1 + A) 



For ImA 7^ 1 and a low signal-to- noise ratio A -C 1 — (Im7)^, the maximal 
degree of contrast is realized at 

with sin^ = Im7, and is determined asyptotically by the expression 

^^7ti:^(ReA + cos6' + £A(Re7 + l)), (2.40) 

with cos 6 = \/l ~ (Im7)2. For one, at 7 = wc get = ^/i 1^0(1 + \/A). 

In the general case of the partially coherent superposition G = F + Fq, 
the solution of the eigenvalue problem for the degree-of-contrast operator C = 
G*G ~ FqFq constitutes a complicated mathematical problem. If we isolate the 
coherent component from the generalized amplitude Fq, we can represent the 
latter in the form 

Fo = ^V^FA* +W, 

with W the incoherent component, F*W = 0, and A is an operator in H, which 
we assume to be bounded: < 1. Next we select the positive constant £ in 
an appropriate manner. Operator C then assumes the form 

C = P + ^y^{PA* + AP) (2.41) 

and is a trace class operator if P = F*F is an operator with a finite trace. 

For high signal-to- noise ratios A = 1/e 1, the eigenvectors Xn ^^'^ the cor- 
responding eigenvalues >f„ of operator C can be found via perturbation theory 
methods. In the first order in the eigenvectors coincide with the eigenvec- 
tors <p„ of the signal density operator P, that is, P<Pon = MnVo?n realize the 
following degrees of contrast: 

^on = (Von I CipoJ = /u„(l + VsRe^J, (2.42) 

with 7„ = {A(p^ \ fn)- The corresponding quasioptimal detection is reduced, 
therefore, to measuring the total degree of contrast 

xo = Tr{CEo) = Yl + \/^Re7„) (2.43) 
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matched with the signal orthogonal modes ip^ of the ideal filter 

Eo ^ N+^{n:Re^„>-l/V^}. (2.44) 

When the intensities of the signal and the noise are comparable, £ « 1, the qual- 
ity of such detection may be considerably lower than that of optimal detection. 
In particular, for the above example of orthogonal ip and ipQ we have 7 = and 
xq = while the quality of optimal detection (|2.28() equal to 

m:+ = ^(1 + \^TT^) > M 

is more than twice as great as xq if the signal intensity is less than half of the 
intensity of the noise, and we have 



— = -(l + VTTe) ^00 ase= — ^00. (2.45) 
xo 2 4^ 

2.2 Multialternative Detection of Wave Patterns 

In this section we will consider the problem of detecting one of several sim- 
ple or mixed wave patterns that is in a partially coherent super-position with 
the background. We will introduce the necessary and sufficient conditions for 
the optimality of such detection, using the criterion of the maximum degree 
of contrast, and give these conditions a concrete meaning for the problem of 
separating such patterns from an incoherent background. We will also give the 
complete solution to the problem of identifying nonorthogonal waves of the same 
intensity. This solution formally coincides with that of the problem of optimal 
discrimination between pure quantum states obtained in |14| . |12| . Finally, we 
will discuss the quasioptimal method of multialternative detection based on the 
perturbation theory for degree-of-contrast operators. 

2.2.1 Statement of the Pattern Identification Problem 

The problem of m-alternative detection of sound and visual patterns described 
in a given spatial-frequency region O by the wave amplitudes ip^{q), i — . . . ,m 
belonging to the Hilbert space H — L^(ri) can be solved in a trivial manner in 
the absence of an audio or optical background, (pg = 0, only on the assumption 
that these amplitudes are pairwise orthogonal, [ip^ \ Lp^.) = for i ^ k. It is 
sufficient to measure the intensity distribution Si = \ x)p in the received 
signal -0 G over the orthogonal modes Xk = V'fe/ ll'y'fcll i * — 

to determine correctly the pattern p = p.^ with a nonzero intensity /ij = ll'/'ill^ 
by specifying the number of the excited mode i : ei = 0. The other 

modes x^,, k ^ i, remain unexcited in the process, and the case = for all 
i = 1 , . . . , TO means that these patterns are absent from the measurement region 

n. 
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The simplest problem of multialternative detection in noise, the problem of 
isolating one of a set of orthogonal amplitudes {(p^} from an incoherent mixture 

with an optical or acoustic background not necessarily described by a trace class 
density operator N, has the same solution if we compare the intensities 

£i = iXi I RXi) = M» + 

of the received signal R G {RiYiLo not with zero but with the background level 
•^i = iXi I Nx,) in the orthogonal modes Xi = fj WiW, i = l,---,m. 

In the case of nonorthogonal amplitudes {v'i}™ i there is no way of measuring 
the intensity distribution in the received signal directly over the modes tp^/ 
We must therefore find a set {Xi}iLi that satisfies the condition 

m 

Elx.)(x,l</ 

and for which a pattern ip^ can be confidently reconstructed from the distribution 
— lii^ \ Xi)P corresponding to the received signal ip S {"y^i}™!- 
The same problem emerges in the multialternative detection of patterns 
ill the coherent superposition ip^^ = (p^ + ip^ with a nonzero background 
amplitude Lp^ even when all the amplitudes {y'j}™ i are mutually orthogonal 
and orthogonal to Lp^. Although in the latter case we can still measure the 
intensities in the orthogonal signal modes Xi = "^il and this yields a total 
degree of contrast xq = X)!" i /^iJ '^6 can achieve a higher quality of detection if 
we minimize the expression 

m m 

^ = E(^^ I C.^J = E I V'JI' - I(X. I ^o)!', (2.46) 

1 = 1 2 = 1 

where 

= IV',)(V'.|-|'^o)('/'oL (2.47) 

as we did in the case with to = 1 in the example at the end of Section |2. II 

Note that in the coordinate representation the wave patterns {(Pi}^i may 
be indistinguishable even if they are orthogonal, as is the case, say, for harmonic 
amplitudes 

ip,{f) = exp{27rj/V$} 

which are orthogonal in the frequency interval [0, and have the same ho- 
mogeneous distributions, |<^i(/)P — 1- The maximal quality of m-alternative 
detection achievable through measurements of the coordinate distribution of the 
contrast degree 



c,{x) = + 2Re^*(x)^o = - |^o(x)|2 (2.48) 
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is determined by the solution to the extremal problem 

>c°i{C)^ sup V(a,/(A,)) = V / c^{x)dx, (2.49) 

where the supremum is taken over the measurable nonintersccting subsets C 
J7 of a coordinate region that can be bound by the union of the supports of 
the integrable functions Ci{x), i = 1, . . . ,to. This limit is attained, obviously, 
on the partitions 

m 
i=l 

of the measurable set in every point of which at least one of the functions 
Ci{x) is positive and coincides on A° with the upper envelope 

C\/{x) = max Ci{x). 

i—l,...,m 

Thus, the total degree of contrast H2.49|l coincides with the integral over of 
the positive part 

c+(x) — max(0, Cv(x)) 
of Cv , which determines the solution of the duality problem 

(c)+ = inf 6(2;)dx|&>c,, i = l,...,m| = J c+{x) dx. (2.50) 

The lower bound H2.50|l over all positive integrable functions b{x) > 0, which 
almost everywhere majorize every function Ci{x), defines the positive gauge of 
the vector function c = {q}™ j^, (c)^ = <^ Ci{x) < 0. The best m-alternative 
detection in the coordinate region Q is reduced, therefore, to a search among the 
Ai for the regions A° on which the measured degree of contrast c{x) is positive 
and reaches c+(a;); in the opposite case of c(x) < c+{x) the pattern may not be 
detected for all x € il+ and it can be assumed to be undetected if c{x) < for 
all X G ft. 

2.2.2 The Optimality Conditions 

Let us consider the problem of maximizing the quality of m-alternative detection 
of mixed patterns Fi : H ^ IC with trace class density operators Pi — F*Fi, 
i = 1, . . . , m, in a partially coherent superposition Gi = Fi -\- Fq for which the 
mutual densities 

f:Fo = V^P^A*/2 

are determined by the contraction operators Ai : H ^ Ti, {e > is a parameter) . 
The respective extremal problem is formulated for trace class operators C = 
(^i)ili of the degree of contrast, 



C; = R^-Po = P, + -^fe{P^A* + A,P,), z = 1, . . . , m. (2.51) 
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Ri = G*Gi, in the class of quasiselective measurements described by any resolv- 
ing operators {Di}™ 

V(Q,A>|VA<^L (2.52) 
' J 

where E is an operator for which CiE = Ci for alH = 1, . . . , m. 

Theorem 2 The upper bound H2.52() is attained on the admissible operators 
Di, i — I, . . . ,m, if and only if there is a trace class operator B° > 0, Ci, i = 
1, . . . , m, such that 

B°{E - D°) = 0, {B° - Ci)D° = 0, i = 1, . . . , m, (2.53) 

with D° = ^i- operator B° then is the solution to the duality problem 

(C>+ = ]nf {(S, E)\B>C, i^l,...,m}, (2.54) 

with >i°{C) = (C)_|_, for which conditions (|2.53|1 are also necessary and sufficient 
when D° > 0, '£T=i < E. 

Proof. The proof, which is similar to the proof of a particular case of this 
theorem, Theorem ^ will be found as a corollary of a more general theorem. 
Theorem O 

It can also be easily proved that the solution to problem H2.52|l exists for 
all trace class operators Ct and determines on subspace £ — ETC for which 
dE — d for alH = 1, . . . , m, a unique solution B° = B°E to problem H2.54|l . 

Indeed, the lower bound H2.54|l determines for the vector operators C = 
(Ci)™ ]^ be the gauge (C)_|_, a positive homogeneous sublinear functional on the 
space of families (Ci)™-^ of the trace class operators Ci_ i — l,...,m, that 
possesses the property (C)^ = <^ Q < for all z's. Bearing in mind that 
every linear functional C (C,D) satisfying the condition (C,D) < (C)_(_ is 
positive and has the form 

m 

(C,D) =^Tr(a,A), 

where X^I^li E>i = E, we find that the set that is conjugate to {C| (C)^ < l} 
consists of the resolving families D = {-Di}™ i of bound operators that are 
admissible in problem H2.52|l . The existence of a solution to problem H2.52|l 
follows, therefore, from the Hahn-Banach theorem, according to which for every 
vector C° of a calibrated space there exists a supporting functional D° defined 
by the conditions (C,D°) < (C)+ and (C°,D°) = (C°)_^. For every solution D° 
to problem l|2.52|l the solution of the conjugate problem (|2.54() on the subspace 
£ = EH. is determined uniquely by the formula 



m 

B°E = B°D° = ^ CiD° 

i=l 



(2.55) 
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which is obtained by adding H2.53|l over i = 1, . . . ,m. Note that the above proof 
of the existence of a solution to problem (|2.52|l and of the uniqueness of the 
solution to problem (|2.54|l remains valid for the case of an infinite number of 
patterns m = oo if we require that (Ci, E) = Tr C,; — > as i cx3. 

Conditions H2.58|l can easily be met for m > 1 by analogy with the case 
of TO = 1 only for commutative Ci, when these operators have a joint spectral 
representation 

m 

a = I ^ra) = Smn- (2.56) 

1=1 

The orthoprojector E can be resolved into an orthogonal sum E = -Eo+X^i^Li 
where 

= \Xn)iXnl ^^ ^ {u G N|><,;„ > {0, Hkn : k ^ l}} 

(points n at which >fi„ — maxj=i^..._m >?jn = x^fen refer to any one of the nonin- 
tersecting sets Ni, Nk)- The operators 

D° = E.„ S° = E E "^MiXnl =Y.^^E^ (2.57) 

2=1 nSNi i=l 

are, therefore, admissible and optimal: 

B°{E - D°) = B°Eo - 0, {B° - C,)E, = C\E, - C,E, = 0. 

Thus, optimal TO-alternative detection in the commutative case is reduced to 
measuring the discrete distribution of the degree of contrast x in the proper 
representation of operators Ci. The total maximal degree of contrast in this 
case is determined from the formula 

>^°(C) = E E = E Inax{>^„ V 0}. (2.58) 

i=l n&li rtSN 



2.2.3 Solution of Optimal Identification 

Let us consider the important case of positive operators C,; — Hi = Si 
which occur, say, in the case of an incoherent superposition of wave patterns 
Fi with a background Fq, when the degrees of contrast H2.51|l are the density 
operators Pi = F^ Fi. The corresponding extremal problem (|2.52() of pattern 
recognition, which is known as the optimal identification problem, is not trivial 
for noncommutative Si, i = 1,...,to, for m > 1 even if these patterns are 
pure, that is, are described by nonorthogonal amplitudes ipi, z = 1, . . . , to. For 
the case of to = 2, however, the optimal identification problem can easily be 
solved by reducing it to the problem of optimal detection with one degree-of- 
contrast operator C = 5*1 — 5*2. Indeed, allowing for the fact that the admissible 
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operators B = L in the duality problem 12.5411 are determined by the conditions 
L > B > 0, i = 1, 2, we can proceed from (12.54(1 to H2.19|l by carrying out the 
substitution 

inf (L, E) = {S2,E) + inf {B, E) 
where B = L — S2 is the admissible operator of problem H2.19|l : 

B> {S,~ 82,82 -82}^ {C,0}. 

Thus, the solution D° to problem (|2.17|l makes it possible to represent the 
solution to problem H2.52|l in the form 

x°iS) = {C, D°) + {82, E) = {81, D°) + {82, E-D°), 

which yields the optimal decision operators D'^ ~ D° and D2 = E — D° . 

To investigate the problem of identifying wave patterns in the multialterna- 
tive case with to > 2, we restrict the space H by the minimal space 8° Q Ti 
containing all the ranges of values Tii — 8iH. Since the 8i > 0, every oper- 
ator B admissible to problem ((2.54(1 is determined by the conditions B > 8i, 
i = 1,...,TO, in view of which it is nonsingular on the subspace £° in the 
sense that BD = D = for every operator D in £°. Otherwise, operator 
D^{B — 8i)D could be negative for at least one i £ 1, . . . , m. This last fact 
means that the first condition in ((2.53() is met only ii E = D°, that is, the opti- 
mal decision operators D° determine the decomposition of unity E° = 
the orthoprojector on subspace £°, and operator B° can be found uniquely by 
summation of the remaining optimality conditions in ((2.53(1 . 

When the subspaces /Ci = HiTi. have a low dimensionality, say, ordinary 
amplitudes Hi = {ip,^\ for which /Ci = C, it has proved expedient to represent 
the solution to problem ((2.52(1 via the following. 

Theorem 3 The optimal decision operators D° determined by conditions 1(2.53(1 
for Ci — H*Hi, i = 1, . . . ,m, have the following form in space £° 

D° = {L°)-'H*fi°H,{n-\ t = l,...,m, (2.59) 

where L° = (X]i=i l^i^i) ^ ^° solution to problem ((2.54() . and the 

^i are trace class positive operators in ICi defined by the conditions 

iU - H,iL°)-'H*)fi° = 0, 1, > HdL°)-^H* (2.60) 

(1; are the identity operators in ICi). If these conditions are met, maximal 
intensity of graded signals x° = X]"=i Tr /i° is achieved. 

Proof. Multiplying the remaining equations in ((2.53(1 from the right by (i°)^/^ 
and from the left by (L°)-i/2, where L° = B°, we can rewrite the optimality 
conditions in the form 



{E° - F*F,)M° ^0, E°> F*F,, z = 1, . . . , m, 



(2.61) 
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where F, = H,{L°y/^ and 

M° = (L°)-i/2L>°(i°)i/2. 

Thus, 

M° = F*F,M° = M°F*F, = F*ii°F,, (2.62) 

where /i° = FiM°F*, which leads to 12.59|l if we carry out the inverse trans- 
formation. If we substitute (|2.62l) into H2.61|l and multiply the result from the 
right by F* and from the left by {F*)~^, we arrive at (|2.6U|) if we allow for the 
reversibility of the operators F* : /C^ — > £. The inequalities (|2.6U|) are simply 
the inequality (|2.61() in the form FiF* < 1^. The operator L° is determined 
by summation = ^° of the optimal decision operators (|2.59|) . which 

yields L° = H°iJL°Hi, and this determines uniquely the positive operator 

B° = L°. 

The proved theorem reduces the solution of the optimal identification prob- 
lem to finding the operators ^° that satisfy conditions H2.60|l . which in the case 
of finitely mixed patterns Hi constitute finite-dimensional algebraic equations 
and inequalities. For one, for pure patterns Hi = ("011: conditions H2.6()|l have 
the scalar form 

/x° = (^, I (i:°)- Vj/i- , 1 > (V'. I (i°)- i = 1, . . . , m. (2.63) 

where L° = (|-0j)/i°(V'J)^^^. The numerical positive solutions of the system of 
algebraic equations H2.63|l determine the onc-dimensional decision operators 

^° = |X.)(XJ, X. = (i°)-V.y^ (2.64) 

(which are equal to zero for those i's for which (V'i | {L°)~^ipi < 1) and the 
quality of the optimal solution, k° = 

Solution of the pattern identification problem makes it possible to establish 
the quasioptimal multialternative detection scheme using the maximum criterion 
of the total degree of contrast (|2.51|l as the first approximation in ^/e for decision 
operators of the form 

A = {Fo: + V^FuTiFo, + V^Fu) = F;^F,,. (2.65) 

Assuming that F^i — y/Jji^Hi{L°)~^ and Doi = F^^Foi, in the first order in the 
signal-to-noise ratio e ^ 1 we obtain the following formula for the degree of 
contrast of quasioptimal detection kq = ^iD^i 

m 

^0 E TrK.Ai°(l + v^(7. + 7:)/2), (2.66) 

i=l 



where 7, = H,A*(L°)-^H* , or 7, = (A,V,|(i°)"V,) when /C, C. ■ 
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2.2.4 The Signal Representation 

It has proved expedient to represent solution H2.59|l to the problem of optimal 
identification of wave patterns in the so-called signal space, /C™ = 0™i A^i, 
which is the direct sum of Hilbert spaces JCi = HiTC and which, in the case of 
ordinary amplitudes Hi = (V'J, is equal to C™. Such decomposition is carried 
out via the partially isometric operator V : Ti. /C™ of the polar expansion 
H = a^l'^V, a = HH*, for the operator H : (p e H ^ i from Ti into 

/C™, which is defined uniquely on the subspace £° by the conditions V*V — E° 
and VV* — e°, where e° is the support of the correlation matrix a = ae°. 
Note that the m-by-m matrix a = [aik] consisting of operator components 
o-jfc = HiHIl, i,k = 1,...,TO {a^k = (V'llV'fc) if Hi = (ipil), is positive and, in 
the case of the linear independence of the signals Hi , nonsingular with support 
e° — ®™ 1 li = 1™- The components Vi = l^V, i — 1, . . . , m, of the isometric 
operator V : £° /C™ determined by the diagonal projectors li from /C™ onto 
ICi bring about, obviously, the decomposition of the unit element 

m m 

E° = V*V = ^V*hV = ^2^*"^^ (2-67) 
of space £° and are orthogonal if £° = 1™: 

Representing the operators Hi in the form Hi — hiV, with hi — 1.^(7^/^, we can 
write the necessary and sufficient conditions for the optimality of the separating 
operators Di in the following form: 

(A° - (T,)Si = 0, A° > cr, := h*hi, i = l,...,m, (2.68) 

which are simply conditions for the decomposition of the m-by-m projection 
matrix e° — J^TLi ^i^ ^1 = VD°V* . TheoremOlin this case assumes the form 
of 

Theorem 4 The optimal decomposition of the support e° of the correlation 
matrix a defined by conditions H2.68|l has the form 

6° = X°^^ hfi°hX°^^ , i — 1, . . . ,m, (2.69) 

where h = cr^/^, A° = {h^°hy/^, and fi° = IJ-i is a diagonal matrix 

H° = [iJ.°Sik] consisting of the positive operators fi° : ]Ci ICi and defined by the 
conditions 

fi° = e{hX°-^h)^i°, 1" > e{hX°-^h), (2.70) 

or n° = e{y/aJjP) if a is nonsingular (e° — 1), where e : a i— > X^i^li Ij'^lj 
is the partial diagonalization operation [aik] i— > [ajfc<5ifc] of the m-by-m block- 
matrices a = [aik] consisting of operators aik '■ ^fe ~> ^i- The quality of optimal 
identification is determined by the trace in /C™ , or k° — Tx 
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Proof. Representation 12.69|l can be obtained directly via the isomorphism V 
of spaces E°n and e°/C". Here (A°)^^ = V*, an m-by-m matrix with 

elements (A*"')"^ : /Ci K-k, is the inverse of matrix A° — VL°V* with respect 
to £°: A°~^A° ^ e° = A°A°~^ Matrix A° = VL°V* consisting of operator 
elements A°j, : ICu — > /Ci is directly expressible in terms of the square root 
h = i/a of the correlation matrix 

/ m \ 1/2 

\° = \Y^Kl^°hA =VV^, (2.71) 

while the conditions (|2.6U|I for determining the operators /x°, which in the signal 
representation have form 

{UhX~^K)y° = 0, U> KX°~'h*, (2.72) 

represent the element- by- element notation for the conditions H2.7U|I imposed 
on the diagonal elements in IC"^. If a is nonsingular (which means that h is 
nonsingular, too), we can rewrite H2.70|l in the following simple form: 

^° = e{hX°-^hfi°) = e{hX°h-^) = e{^/SjJ°), (2.73) 

where we have allowed for the fact that e(a)/i° = £{aiJ,°) (because /i° is diagonal) 
and that cr/i° — {hX°h^^)^ , in accordance with (|2.7U|) can be resolved explicitly. 
Let us assume that the diagonal part e{h) of matrix h = cr^/^ is commutative 
with h. Then conditions (|2.7U|) are met at /i° = e{^/a)'^, that is, at — (hu)'^, 
i = 1, . . . , m. Indeed, the diagonal matrix fi° in this case is commutative with 

h and 

A° = = Vh^tJ-° = = he{h) = V^e{y/^). 

Moreover, hX°^^h = he{h)^^, where e{h)~^ is the diagonal that is the inverse 
of e{h), which always exists because the diagonal elements an = HiH* of the 
correlation matrix a are nonsingular and, hence, so are the diagonal elements 
hii of the matrix h = ^/a on the spaces fCi = HiH. Thus, 

e[hX°-^h) = e{he{h)-^) = e{h)e{h)-^ = 1™, 

and conditions H2.70|l are satisfied. The optimal decision operators then assume 
the form 

<5° = 1„ I?° = y*l,= V;*y„ z = l,...,m, (2.74) 

where V ~ h^^H, while the quality of optimal separation is determined by the 
total intensity: 

m m 

>c° = j2Ti-{hur = Y.Tr{a'j'r. (2.75) 

1=1 i=l 

The above- noted property of commutativity manifests itself, for one thing, in the 
case where all diagonal operators an coincide and are multiples of the identity 
element 1^ = 1 of space ICi = JC, which is the same for alH = 1, . . . , m. In view 
of the assumption that an is a trace class operator and, hence, /i° = {an)'^ , this 
is possible only for a finite-dimensional IC. In Section f2.2.5l we consider concrete 
equidiagonal families of ordinary amplitudes Hi = (V^J, for which /C = C. ■ 
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2.2.5 Separation of Cyclic Systems 

Let {•4'i}iLi be a family (or set) of nonorthogonal wave amplitudes G 'H 
that describe sound or visual patterns with a correlation matrix a = [(V'ilV'fe)] 
whose square root, h = has the same diagonal elements ha — a = hkk 
for all i,k — l,...,m. The optimal identification of wave patterns {ipi} is 
described by the one-dimensional separating operators {D'^}^^i of the form 
(|2.57l) . where Xk = C^, k — 1, . . . ,m, is generally an overcomplete system of 
polar decomposition, 

m m 

^k=Yl ^k(^ki^ = ^°khki, 
1=1 i=l 
m m 

E \Xk)ix°k\ = E ^kVk = V*V = E°, 

fe=l k=l 

in the space £° induced by the set {ipk}- Bearing in mind that fi = = 
(Tr /i/m)^, we can represent the maximal intensity x° — ma? of optimally 
separated amplitudes {Xi } in the following invariant form: 

;^°^l(Tr;^)2 = l(Trai/2)2. 
m m 

Let us consider the following example when the above-mentioned condition 
of the equidiagonality of matrix h — ^Jg is met. We will call the system {V'i} 
of amplitudes of equal intensity HV'ill = v equiangular if (V^J'^fe) = for 
every i ^ that is, if the cosines of all mutual angles are equal to 7. This 
is possible in the case when 7 > 1/(1 — m), say, when -0, — Lp^ + Lp^, where 
{^iYiLo ^n. orthogonal system of amplitudes with intensities Hi^SqII = and 
= v(l — 7) at i 7^ 0. Representing the respective correlation matrix a in 
the form 

a = - 7)1" + 7x^0;), .X = (1, . . . , 1) e C™, (2.76) 
and using the formula 

/(I™ + TX^X) = /(l):" + ^[/(l + TXX^) - f{l))x^x\ 

xx^ 

to invert it and extract a square root, we can write out the optimal system {xl} 
for 7 e](l — m)^^, 1[ explicitly: 

xl-^\ ^^k -(!-(! + f^)-/^)- E 

1^ 1-7 J 




The intensity of the signals separated by this orthogonal system is 
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and admits the maximal value x° = m.v in the event of orthogonality 7 = of 
the family {tp^} and the value >c° = fj, in. the case of colinearity of {tpf.}. 

For one, when the tp^ = \ai) are canonical equiangular amplitudes defined 
by a (d + l)-hy-(d + 1) matrix of the scalar products of vectors a; G C''"'""'^ of 
the form Q;*aJ = XS for i ^ k, \oLi\'^ = A for alH = 1, . . . , m, the quantity 7 = 
exp {A(^ — 1)} does not vanish and the maximal intensity of optimal separation 
is always lower than m/i even if the vectors {ai\ are orthogonal {8 = 0) and 
tends to mj only as A ^ 00. Note that the maximal intensity of separation of 
canonical amplitudes is reached on simplex vectors e C^^^ defined by the 
condition S = (1 — m)^^; for one thing, at m = 2 the intensity of separation of 
a pair of canonical amplitudes, 

x° = u{l - = 1 - Vl-e2M«-i), 

can be attained at ^ = — 1 by employing orthogonal vectors ai, while at 5 = 
this can be done only by doubling A = |a,;|^. 

Equiangular systems constitute a particular case of cyclic systems, which 
are defined by the condition that the correlation matrix an- remain invariant 
under translations s € Z : (i, fc) {i + s, k + s), that is, at = ~ k). 
Such translation invariant systems as containing only a finite number m of 
distinct amplitudes must satisfy also the cyclicity condition (j{l) = a{l + s) for 
I ^ i — k < Q. Since the matrix h — ^/a, as any other matrix function of 
a, also depends solely on the difference in the indices, or hi^k = h{i — k), the 
equidiagonality condition hi^k = a = h{0) is certain to be met and the solution 
to the problem of separating any cyclic system can be written explicitly. 

Let us take the case of cyclic canonical amplitudes Vfe = l^fe) defined by 
complex numbers afe e C whose real and imaginary parts can be interpreted as 
the mean frequency and duration of the wave packet \ai). There can be only 
two cases of the cyclicity of amplitudes |aj) corresponding to the equidistant 
distribution of points a, along a circle or a straight line with the center at a = 0. 

(1) Optimal estimation of phase Let a, = \/Ae^'^'j*'/'", j = \/— T. In this case 
we have a cyclic system 

aik = exp {A(e-2'^j('-'=)/™ " l} = " '^)- 

To extract the square root of matrix a one should diagonalize it by a discrete 
Fourier transformation applying the unitary matrix 

Uin = exp {2Trijn/m} /^/m, n = 0, . . . ,m — 1. 

A continuous analog of this problem, to which one can pass if m is sent to 
infinity, is the estimation of phase 9 of the vector ao = •\/Aexp{27rj6'} of the 
canonical amplitude tjjg = \a0) on the interval [0, 1]. Diagonalizing matrix 



= {aM = exp{A(e-2^j(--«) - l} 
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via a discrete-continuous Fourier transformation ji^^ = exp{27rjxn}, n G Z, we 
obtain its eigenvalues 

A„ = A"e-^/n!, n = 0, 1, . . . ; A„ = 0, n < 0. 

The optimal system of decision vectors xSj a; S [0, 1], has the form 



/(.X 

n=0 



oo ^ 

Ee2-j-"|n), where \n) = — (A*)"|0), 



with A* the creation operator in H ~ iy^(R). It can easily be verified that the 
system x° defines the decomposition of unity, 

^1 oc ^1 oo 

1= \x:)ix:\dx^ Yl l")H / e2-j-("-™)dx = E|n)H, 

but is not orthogonal. 

(2) Optimal estimation of amplitude. Let us take = iAe^^, where i € 
Z, A > 0, and j = y/—l. In this case the cyclicity condition is satisfied: 

aik = exp{-A^(i - /2} = a{i - k). 

The matrix a — [uik] is diagonalized by the discrete-continuous Fourier transfor- 
mation Ui\ = exp{27rjiA}, A € [0, 1]. For A <C 1 the problem of optimal separa- 
tion of the respective coherent amplitudes is reduced to the problem of optimal 
estimation of the real parameter cc G K of the coherent amplitude = \xe^^). 
This estimation is realized by measuring the intensity in the proper representa- 
tion of the self-adjoint operator Re Ae"-'^ in space H. = L^(]R). At = this is 
the frequency representation, while at = 7r/2 it is the temporal representation. 



2.3 Optimal Discrimination of Mixed Waves 

In this section we will take up the problem of testing wave hypotheses based on 
measuring the appropriate intensity distributions. We will derive the necessary 
and sufficient conditions for optimal testing of such hypotheses by the minimum 
criterion of parasitic contrast at a fixed level of the received signal by employing 
a method of linear programming in partially ordered Banach operator spaces. 
In a specific case these conditions formally coincide with conditions obtained 
earlier in |53j on the optimality of quantum measurements by the minimum 
criterion for the error probability. A general geometric solution will be given 
for the case of a two-dimensional space, which is sufficient for describing the 
recognition of the polarization of a plane wave. This solution is similar to the 
solution of the problem of measuring quantum mechanical spin (H] . 
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2.3.1 Wave Pattern Hypotheses 

The problem of recognizing sound and visual patterns based on measurements 
of the intensity of the received audio or optical wave can be formulated within 
the framework of the wave theory of hypothesis testing discussed below. 

Let Hi, i — 1, . . . ,m, be bounded operators from the Hilbert space Ti. to 
another Hilbert space /C describing the possible generalized random amplitudes 
at the "in" terminals of the receiver with density operators Si = H*Hi with 
a trace Tt Si < oo. The reader will recall that at /C = C the operators Hi 
correspond to ordinary amplitudes ^i £ Ti, z = 0, . . . , m, which define the 
bounded functionals Hi = (V^J : x ^ ^ (V'ilx)- Each operator Hi, i = 
0, . . . , m, can be thought of as a hypothesis, according to which at the "in" 
terminals of the receiver there is one of the possible simple or mixed patterns 
Gi, z = 1, . . . , m, in a partially coherent superposition Hi — Gi + Hq with the 
absence of wave patterns Gi . The problem of m-alternative detection of wave 
patterns Gi, i = 1, . . . ,m, may, therefore, be considered as a problem of testing 
m + 1 hypotheses Hi, i = 0, . . . ,m, and vice versa. 

The optimal testing of the hypotheses Hi, i — 0, . . . ,m is determined by the 
solution to the problem of finding a quasiselective measurement D = {-D,;}™q 
that maximizes the quality functional 

m m 
1=0 i=0 

where R = q are trace class operators with a common support E : RiE = 

Ri for all i = 0, . . . , m, operators that are usually represented by linear combi- 
nations Ri = X^I-LqC^'^A: of density operators Si = H*Hi. For instance, in the 
problem of m-alternative optimal detection by maximum of the total contrast 
H2.52|l criterion, the operators Ri are in effect the degrees of contrast i?o — 0, 
Ri — Si — So = Gi, i = 1, . . . ,m, and the admissible operators {I?i}™o 
determined by the decision operators Di, i — 1, . . . ,m and Dq = E — D, where 
D = I^o^' the problem of discriminating between the hypotheses Hi 

we can consider more general criteria defined, say, by the operators 

m 

i?o-5I^*' i?^ = (l + AOa, A. >0, i = l,...,TO, (2.77) 

i=l 

that appear in the problem of suppressing parasitic degrees of contrast {Ci, D^) 
for i ^ k: 



= 1 \ k^i I i=l 



under the condition that the useful degrees of contrast {Ci,Di), i — 1, . . . ,m. 
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are not lower than given levels e^. Indeed, if we solve the extremal problem 

{rn m ^ 

■m 

= sup I V((C,;,A)(1 + A0 

D.>0 ^ ^ V 

+ (a,S-i?>)|^A <£^}, (2.79) 

for the operators Ri = Ri (A) defined in H2.77|l . we can write the solution to the 
problem (|2.78|l in the form 

T°(C) = V (a, E) + sup i V X^e^ ~ >c° {R) \ 

^=l ^■>° [^=l J 

+A,(£,-(C„A)))|5IA <s}, (2.80) 

1=1 

provided that we employ Lagrange's method of multipliers Ai, i = 1, . . . , m. 
Let us start with the classical variant 



4,{R) = sup i V M(A,)) I E ^» = ^ 
^•>o b=0 '^=0 J 

rn 

= Em^(^°) (2-81) 



i=0 



of the problem (|2.79ll of optimal testing of hypotheses in a fixed measurement 
described by the decomposition E = J M{dx) of an orthoprojector E, RiE — 
Ri, on a Borel space X. This may be the coordinate selective measurement 
M{dx) = I{dx), X ^ Q,, ov X the momentum quasiselective measurement 
M{dx) — I{dx), X = M'^+^, or the canonical quasimeasurement M{dx) — 
\x){x\Ax, X = M'^+i, described in Section O The upper bound (|2.81|) in 
measurable partitions X = reaches the gauge 

(/^) =inf{A(X)|A>Ai„ z = 0,...,m} = Aiv(^) (2.82) 

of the family (/ij^o '^^ measures /^^(A) — {Ci, M{A)), where the infimum is 
taken over all the measures of finite variation |A|(X) < oo that majorize all /ij. 

Indeed, >c°j^j{C) < >cy{^), since for every measurable partition Q, = X]i=o — 
X, obviously, 

m m 

5]m.(A,) < ^A(A0 = A(0) < X{X). 

1=0 i=0 
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The lower bound (|2.82ll is attained at the upper bound /Zy = V™Q^j of the 
family of measures defined as /Xy > {Hi}^Q, A > /ij A > /ly, and 

is equal to the supremum (|2.50() reached on the partitions Q = X^Ho 
support n C X oi measure /i^ into regions A°, on which it coincides with the 
respective measure fi^ : 

Aiv(A°)= max ^,(A°) = /i,(A°). (2.83) 

fc— U,...,m 

In view of the last relationship, determining a hypothesis Hi for a given mea- 
surement M is reduced to searching for the number of the nonempty region A° 
on which the measured degree of contrast reaches the envelope /iy(A°) of the 
family {fi^}. However, this method does not enable us to find the wave patterns 
for which ^(A°) < n^{A°) for alH = 0, . . . , m. 

2.3.2 Optimal Multialternative Testing 

To obtain a satisfactory solution to the problem of wave pattern recognition one 
must look for the supremum H2.81|l not only over the measurement regions A^ 
but also over the various methods of such a measurement, which are described 
by the resolving operators Di = M{Ai). Thus, there emerges a nonclassical 
extremal problem H2.79|l . which may be considered as part of the conditional 
problem H2.78|l of testing the hypotheses Hi in the degrees of contrast (C^, Di), 
which are compared with given level e^, i = 1, . . . , to. The necessary and suffi- 
cient conditions for solving this problem are formulated in the following. 

Theorem 5 Theorem The upper bound H2.79|l is attained on operators D°, i = 
0, . . . , m if and only if there is a trace class operator L° > Ri, i = 0, . . . , m, 
such that 

{L° - Ri)D° = 0, i = 0, . . . , TO. (2.84) 
The operator L° is then the solution to the duality problem 

(R)^ = inf {(L,£;) |i > i?„ i = 0, .. .,to} (2.85) 

for which conditions H2.84(l are also necessary and sufficient for D° > 0, X^I^Lo ~ 
E, and >{°{R) — (R)^- The solution L° to this problem for operators Ri — 
Ri, j = 0, . . . , TO, of the form (j2.77(l represents the solution to the constraint 
extremal problem H2.78(l in the Lagrange form 

r°(C) = Y^m, E) + A°£,) - E) , (2.86) 
1=1 

where the parameters A° > are to be found from 



A° (£, - (C„ Z?°)) = 0, < (a, D°),i^l,...,m, 



(2.87) 
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Proof. The sufficiency of the optimahty conditions 12.84|l for H2.83|l and H2.85|l 

can be verified directly by employing the property of monotonicity of the trace, 

L>R, => Tr (LA) > Tr 

for Di > 0. Allowing for the equality L°E = J2^o ^i^i' which is obtained via 
summation of (|2.84() over i — 0, . . . , m, for every family (-Dj)™ q admissible in 
ifT^ we have 



i=0 i^O i^O 

m 

= TriL°E) = Y,{R.,D°). 

i=0 

In a similar manner for every operator L admissible in H2.85|l we have 

m ra 

(L, E) = Tr {LE) - ^ Tr {LD°) > ^ R,D° = (A, E) . 



i=0 i=0 



The necessity of the optimahty conditions H2.84|l follows from Lagrange's duality 
principle 



sup 

Di>0 



.1=0 i=0 > 



sup inf <^ y D,) + {l.E-Y 



= inf sup I y (R, ~ L, D,) + (L, E) \ 
= inf{(L,£;)|L>i?,:,i = 0,...,to}, 
according to which J^T^o {Ri^D,) = x°{R) = (i?) = (L°, E) and 

y Tr [(A - i?,)^- ] = Tr {L°E) - ^ . 



i=0 



The necessary and sufficient condition for this sum of traces of products of 
positive operators to vanish is, obviously, Equation (|2.84() . 

Employment of the duality principle in the conditional problem (|2.78f) re- 
duces this problem by the elementary Lagrange method to problem (|2.8U|) . for 
which the necessity and sufficiency of conditions H2.87|l can be verified directly. 
The proof of the theorem is complete. ■ 

Note that the above proof remains unchanged in the case of an infinite 
number of hypotheses, m — oo. From this theory follows, for one thing. Theorem 
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13 if we put Rq = 0, Ri = Ci, i — 1, . . . ,m, and L = B. The existence of a 
solution to problem (|2.83() and the uniqueness on the subspace £ — EH of the 
solution to problem (|2.85l) can be obtained from the proof in Section [2.2.21 of 
these assertions for problems H2.52|l and (|2.54() to which H2.83|l and (|2.85|l are 
reduced by the substitutions Ci = Ri — Rq and B — L — Rq. 

In the case of positive i?i's the problem of testing the hypotheses Hi,i = 

1 /2 

0, . . . , m, can be solved as a problem of separating m — 1 signals Hi = Ri by 
applying Theorems 01 and 01 For nonpositive Ri's it has also proved expedient 
to go over to the signal space /C'"+^ — 0™Q/Ci, JCi — HiH, i = 0, ...,m, 
via a partially isometric operator V : H ^ ^m+i polar decomposition H = 
0-1/2^,(7 = HH* for the operator H : ip £ H ^ [ifiV?]^o- a result, the 
optimality conditions for the decision operators D° can be written in the form 
of conditions imposed on the decomposition e = X^i^o '^i ' where 6° = VD°V*, 
of the support £° = VV* of the signal correlation matrix aik = HiH^ with i,k = 
0, . . . , m: 

m 

(A° - p,) S° = 0, X°>p,=Yl ^"^^1^^ i = 0,...,m. 

Here A° = VL°V*, hi = lih, h = y/a and is the quality matrix, which 
defines the operators Ri — J2T=o ^k'^i^k and which, for a fixed m, it has 
proved expedient to consider as being a diagonal operator a = ®feLo'^?lfc ™ 
space because then the signal matrices p.^ ~ VRiV* can be represented 

in the form p^ — hcih. 

Even if the amplitudes Hi are ordinary, that is. Hi — (-i/jj and hence the 
correlation matrix is a number matrix an^ ~ it is difficult to write 

conditions of optimality explicitly for m > 1 for a nonsingular matrix a. Below 
we will study this problem for the case where the rank of matrix is equal to 
2 and, hence, all the operators Ri,L, and Di can be represented by 2-by-2 
matrices in space £° = C^. 



2.3.3 2-d Wave Pattern Recognition 



To the operators {Ri} in the optimization problem (|2.84l) we assign Hermitian 
matrices that can be considered nonnegative without loss of generality. Any 
2-by-2 matrix can be decomposed in Pauli matrices, which are 



1 
1 
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R 



where x,y, z and v are real if matrix R is Hermitian, and f = xux + yo'y + za^ = 
r • cr is a vector operator represented by vector r — (x, y, z) of three-dimensional 
real space M.^^. The product of f and s, with r e M'^ and s e M"^, is equal to 
f s = r • s+j (r X s) , where r • s and r x s are the scalar and vector products of 
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r and s. Note that Tr R = 2^ and Det R ^ v'^ - |rp (with |r| = V^T), and 
that the nonnegativity condition i? > assumes the form = |r| ^ 0, and rank 
R^Q at V = Q. 

The operators Ri — {i>i+Vi)/2, i = 0, . . . ,m, where {i^il > \ri \ and X]"=o ~ 
1, can be interpreted as density operators related to the tested wave hypotheses 
with prior intensities i^i = Tr Ri and represented by vectors S R^, which 
are known as polarization vectors. A similar problem arises when we must 
identify the photon polarization or the electron spin "14). Let us assume that 
polarizations {ri} satisfy the inequalities 

\rk - r,\ > - (2.88) 

for all k i; in the opposite case, that is, — > |rfc — ri|, the fc-th hypothesis 
dominates the i-th hypothesis or vice versa: Rk > Ri or Rk — Ri or Rk < Ri, 
and one of the hypotheses (with the smaller ly) can be ignored. 

The decision operators Di in the Pauli representation Di = 5i + di are 
described by nonnegative numbers 6i > and vectors e K'^ (|di| < Si), with 
the decomposition of unity J^^o — ^ assuming the form 

m m 

^<5, = 1, ^d, = 0. 

1=0 j=0 

The solution of the problem of optimal recognition of polarizations rj with 
intensities Vi can be reduced to finding a real number A° and a vector 1° e M.^ 
that defines the operator L° — (A° + l°)/2 that satisfies conditions H2.84|l for a 
collection of D° = S° + = 0, . . . ,m. 

Theorem 6 The solution to the problem of optimal recognition of polarizations 
{ri} satisfying together with {vi} condition H2.88|l can be found if and only if 
there is a collection of numbers /i° > 0, i — 0, . . . ,m such that 



fe=0 



+ Y.{v,-vk)^il>l, i^Q,...,m, (2.89) 



fc=0 



where the equality takes place at least for those i 's for which /i° 7^ 0. The optimal 
decision operators have the form 5° ^ |d°|, d° /i°(ri — 1°), where 

m m 

l° = ^M°r./5]M°, (2.90) 

i=0 i=0 

and the maximal received intensity is 

(m \ m 

i + J2f'>n /J2^'° = ^°■ (2.91) 
j=0 / j=0 
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Proof. In terms of A°, 1°, Vi, and r^, the first optimality condition l|2.84|l . L° — 

Ri > 0, has the form 

A° > |r, - 1°| + i.. (2.92) 

The equation (L° — Ri)D° ~ can be written in another form if we nullify the 
scalar and real vector parts of the product (A° + Vi — \° — ^i){5° + d.;): 

(A° - Vi)d1 + (1° - rO<5° = 0, (A° - i,,)61 + (1° - r,) • d° = 0. (2.93) 

The imaginary vector equation j(l° — r^) x d° = 0, j = \/"-T, follows from the 
real vector equation in (|2.93|) are equivalent to 

d° = <5°(r, - 1°)/(A° - J.,), ((A° - - |r, - r\f8° = 0, (2.94) 

in the opposite case (A° — Vi for a certain i in (|2.92l) ') for i = fc we obtain 
1° — r.i, with the result that inequalities (|2.94l) and (|2.88|) become incompatible. 
The optimal decision vector can be written in the form l|2.90|l . where /i° = 
6° / {X° — Vi) is nonnegative in accordance with the inequalities 5° > 0, X° > Vi, 
and H2.92|l . while 1° is determined by the set in accordance with the fact 
that YllLo ^1 ~ 0- The second equation in H2.94|l implies that inequalities (|2.92|l 
become amplitudes for the values of i for which 5° = ^^{X^Vi) ^ 0. Multiplying 
(|2.92|) by s-'^d finding A in the form 

Cm \ m 

from the condition that X]i=o ^ ^ '^o^' 

<5°-/i°(A~i..)=A^°|r.-11 = |d°| 

we get condition (|2.89l) for determining Since Tr L° = A°, the maximal 

intensity of H2.85|l is equal to H2.91|l . The proof of the theorem is complete. ■ 
Note that the equalities in H2.89|l must be true for at least two indices i and 
fc, since there is no such set {Hi} , /Xj ^ 0, for only one subscript i that satisfies 
the i-th inequality. For every pair and for which (|2.88() is valid there is 
a unique solution of the «-th and fc-th equalities in H2.89|l with /i^ = for all 
j 7^ i, k with ^^ > 0, ^j, > 0: 

= (|rfe - Vi\ + Vk - Viy^, Mfc = (ki - rfcl + Vi - VkT^, 

but such a set {/ij} may not satisfy the other inequalities in H2.89|l for j ^ i, k. 
If there exists a pair , rfe for which all the inequalities in H2.89|) are valid at 
^ only when j = i, k, then the optimal decision vectors d° are zero for 
j j^i,k (see (ESOJ) and 

d° = (r, - rfe)/2|r, - r,.], d?. - (r^ - r,)/2\rk - r,|. 

Here the optimal decision operators D° = \d°\ + d° are orthogonal and corre- 
spond to an error intensity 
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In the case where the optimal operators D° are nonzero for more than two 
z's, they define a nonorthogonal decomposition of unity in the two-dimensional 
space £ = C^. We will not try to find a general analytical solution to the 
system of equation H2.89|l with /i° 7^ for more than two z's; rather, we will give 
a geometric interpretation of such a solution. 

2.3.4 Geometric Representation of 2-d Patterns 

Let us represent the Hermitian operators (|2.86|) by points r — {i/, x, y, z) = [v, r) 
in the four-dimensional Minkowski space R^^"^. To every nonnegative operator 
there corresponds a point inside the light cone v — |r|. In these terms a priori 
neither the fc-th nor the i-th hypothesis is dominant at Ri ~ {vi + f j)/2 and 
Rk = {vk + ffc)/2 if and only if th e interval - ru = {vi - Vk,Yi - r^) is 
spacelike. In accordance with (|2.92(l . point 1° = (A°,l°), which represents the 
operator L° ~ (A° + 1°), is the apex of the four-dimensional cone 

C{1) = {r = (z/, r) : - A° + |r - r| = 0} (2.95) 

covering all the points r.^ = (i/;, r;) and containing the subset {vj^ } C {r^} of the 
boundary points rj^ satisfying H2.92|l . On the other hand, the optimal points 
/ are only those whose projections 1 belong to the convex hull of the boundary 
subset of the spatial projections r^^ , a = 0, . . . , s, s <ni: 

s s 

where, in accordance with (|2.9U|) . tTj^ — ^J-j^/ J2a=o f^j^ > (^^ = if is 
covered by cone (|2.95|l : i^i + |r.i — 1| < A). We will say that the subset {rj^} 
has an apex if the points rj^ lie on the cone: rj^ C C{1) with an apex I whose 
spatial projection 1 is a point on the convex hull {fj^}. In these terms Theorem 
El can be formulated as follows: 

Theorem 7 To solve the problem of optimal recognition of points Vi = (i^i, r^), i = 
0,...,m, separated by spacelike intervals (I2.88() . it is necessary and sufficient 
to find a subset {vj^ } with an apex 1° belonging to a cone that covers all other 
points of set {ri}, that is, to specify a subset of vectors Vj^ C {r^} , a — 0, . . . , s 
whose convex hull contains vector 1° with respect to which the sum \rj^ — + 
is the constant X°: 

|rj„ - 1°| + t/j„ = A°, a = 0, . . . , s, (2.97) 

while \ri ^ l°\ + Vi < A° for all other indices i G {ja}- The optimal decision 
operators are represented by points on the cone 

d°^{6°,d°), 5°^\d°\, 

with spatial vectors 

m 

d° = <(r,-n/^7r,r„ (2.98) 

i=0 
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where 7r° = for i ^ {ja}, cmd {ti""^ , a = 0, . . . , s} is any nonnegative solution 
to the system of equations (|2.96|l . The minimal intensity in this case is 

m 

x° ^Y.^v, + \t,-\°\)tt,. (2.99) 

1=0 

Note that every pair of point r^, separated by a spacelike interval defines, 
via two equations from H2.97|l . = i,k, a set of point 1 € R^+'^ whose difference 
of distances to the points and is constant: 

\r - Vkl - \l° ~ r,\ ^ ly, - i^k. (2.100) 

These points lie on one of the two sheets of the hyperboloid of revolution with 
foci at Ti and and eccentricity 

Here, if vi — Vk, the hyperboloid H2.100() becomes a plane normal to the segment 
riTTi+rfeTTfe (tt > 0, TTi+TTfc — 1) at point (ri+rfc)2, while if ^ Vk, we select the 
sheet in whose plane lies the focus with the higher intensity, Vi or z/^ . Obviously, 
if the subset {rj^} has an apex I, the spatial projection 1 is the common point 
of all the hyperboloids H2.100|) corresponding to all the pairs of the set {r^^ } 
that belong to the convex hull {fj^}. We will call this point 1 the center of 
{rj^} representing L° is unique, which means that the center of {rj„} is unique, 
too. It can easily be shown that for every vector 1 of the convex hull {vj^} 
the system of linear equations H2.96|l has a unique solution {tt"^ } if and only if 
vectors Vj^ ~ Tj,,, a = 1, . . . , s, are linearly independent. 

2.3.5 Optimal and Simplex Solutions 

The reader will recall that a convex hull of a set {r^^ } of points rj^ , a = 0, 1, 2, 3, 
is called an s-simplex (a segment if s = 1, a triangle if s = 2, a tetrahedron if 
5 = 3, and so on) if the vectors r^p, Vj^ , a = 1, . . . , s, are linearly independent. 
It is well-known that each s-dimensional face (an s-face) of an n-simplex (n > s) 
is a simplex, too. We will call a subset that generates a simplex convex hull a 
simplex subset. 

Theorem 8 The problem of optimal recognition of polarizations {r^, i — 0, . . . , m} 
always has a solution that can be described by the simplex set |d° ,Q! = 0, ...,s},s< 
m, of the nonzero vectors H2.98(l corresponding to the simplex subset {vj^ } C 
{r^} with a center at 1° and a maximal sum 

'^jc + kj„ - 1°| = max {v, + |r,; - 1°|} . 

■t=0,...,m 

This solution is unique if and only if the s-simplex generated by subset {rj^} is 
an s-face of the convex hull of all vectors rj^, . . . , Vj^^ with a common center 1°. 
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Proof. By Theorem [7| the solution to the problem considered here is reduced 
to finding the cone H2.95|l that covers all points {r^} and has an apex 1° with 
a projection lying inside the convex hull of projections {r^^ } of the tangency 
points rj^ . Obviously, there is always such a cone. Let n < m he the number 
of tangency points rj^,a = 0, . . . , n. If the subset {r^^ , a — 0, . . . , s} {s < m) 
is a simplex set, the validity of Theorem 12.801 is obvious. If this subset is not 
a simplex, then the convex hull {yj^ } can be partitioned into several simplexes 
with a common vertex rj^ via diagonal planes {vj^ , rj^ , r^^ ) or diagonal lines 
{vjg , Tj^ ) when all the vectors r^^ are coplanar. Hence, the center 1° is an interior 
point of one of the s-simplexes (s < n < m) with apexes Vj^ , a = 0, . . . , s, which 
are the projections of the tangency points and determine the unique positive 
solution {tt"^} of system (|2.96|) . The set {d°^} of nonzero vectors (|2.98l) is a 
simplex set if and only if the set {vj^ } is a simplex and determines the optimal 
solution with maximal quality H2.97|l and minimal error intensity H2.99|l . When 
center 1° is an interior point of a nonsimplex convex hull of the projections of 
tangency points, the optimal simplex solution is not unique (the partition into 
simplexes is note unique) and there are also optimal nonsimplex solutions. But 
if point 1° is a boundary point of the convex hull, that is, an interior point of 
an s-face, the optimal solution is unique if the face is an s-simplex. ■ 

Corollary 9 To solve the problem of optimal testing of several hypotheses in 
the two-dimensional space £ — C'^ , it is sufficient to limit oneself to s + 1 < A 
solutions Jo, ... , js corresponding to a simplex subset of hypotheses Vj^,..., Vj^ . 
Each such solution procedure can be realized in an indirect measurement de- 
scribed by an orthogonal decomposition in the observation space 7i = (E) . 

Indeed, in the three-dimensional space M.^ there is not a single simplex subset 
Tjg,..., Tj^ for s > 3 and, therefore, for every m there always exists an optimal 
decomposition in the two-dimensional space £ consisting of s -I- 1 < 4 nonzero 
decision operators = + of rank one. It is well known that each 
nonorthogonal decomposition of unity in operators Dj^ , . . . , Dj^ of rank one can 
be extended to an orthogonal decomposition in an (s-l-l)-dimensional space £ C 
Ti. Hence, we can limit ourselves to the four-dimensional measurement space 
7i, which can always be represented as the tensor product of two-dimensional 
spaces £ : Ti. = £®£^ corresponding to the composition of two identical systems. 

Note that the optimal solution may be degenerate (in the sense that a hy- 
pothesis r.i may correspond to Di ~ 0) even if the set ro, . . . ,r„i is a simplex 
set (m < 3), for example, a.t m — 2, = vi = V2, if the vectors ro,ri and r2 
form an obtuse triangle. 

In conclusion of this section we will consider two particular cases. 

(1) Optimal recognition of pure polarization. Here the polarizations are nor- 
malized to prior intensities: |ri| = i>i, with the representative points = (i^;, r^) 
belonging to the cone i' = |r|. Expression H2.97|l . which determines the subset 
of points rj^ of tangency of this cone and the covering cone H2.95|l , has the form 



|r,.-11 + |r,J=A°. 



(2.101) 
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In relation to r^^ , this is the equation of an eUipsoid of revolution with foci 
at and 1° and eccentricity e — |1°|/A° < 1. In accordance with (|2.92l) . all 
the other point <^ {rj„} must lie inside the ellipsoid. Hence, the problem 
of optimal recognition of pure polarization is reduced to finding the ellipsoid 
described about points {r^} with foci at and 1°, where 1° is an interior point of 
the convex hull of the points of tangency {rj„}- The quality >c° of the optimal 
solution is equal to the length of the major axis of the ellipsoid, A°. 

(2) Optimal recognition of equi-intensity polarizations. The priori intensities 
Vi — uq, i — I, . . . ,m, and the corresponding points are points of the hyperplane 
= i/Q. The density operators Ri — (i/ + ri)/2, all having the same trace Vq, are 
represented by normalized vectors, |ri| < i^q. The intersection of the covering 
cone (|2.95f) and the hyperplane v — is a. sphere |r — 1°| = p of radius p = 
X° — vq. Hence, the problem of optimal recognition of equiprobable polarizations 
is reduced to finding a sphere described about all points : |r — 1°| < /? with 
radius p and centered at 1°, the center belonging to the convex hull of the 
tangency points r^^ : |rj^ — 1°| = P- The radius p = X° — i>o determines the 
maximal intensity H2.91|l : 

}<°^p + vo = >H (2.102) 

{p < va since |ri| < vq for all i's). The minimum of intensity H2. 102(1 is obtained 
at p = i/Q : >f° = 2vo. This corresponds to the typical equiprobable case |ri| = 1, 
when there is at least one simplex subset {rj^} for which the center 1° = is 
an interior point of the simplex. 



Chapter 3 



Efficient Estimation of 
Wave Patterns 



In this section we develop the noncommutative theory of efficient measurements 
and optimal estimation of unknown parameters of wave patterns as applied to 
problems of sound and visual pattern recognition. We consider two variants of 
the lower bounds for the variance of the measured parameters, the variants be- 
ing based on noncommutative generalizations ^Sl; [Hll of the Rao-Cramer 
inequality 0^1 j ^^^d introduce the notion of canonical states ^7], for which we 
derive generalized uncertainty relations similar to the quantum mechanical un- 
certainty relations PU], JS]. We then establish the necessary and sufficient 
conditions for efficient measurements, conditions that extend the conditions of 
efficiency of quantum mechanical measurements obtained in jl7) . |18| to the case 
of classical wave signals and fields. We formulate the necessary and sufficient 
conditions for the optimality of generalized measurements of wave patterns, con- 
ditions that generalize the respective conditions for quantum systems, see for 
example |37| . Finally, we investigate the structure of optimal covariant measure- 
ments for symmetric wave patterns, which in the case of quantum symmetric 
fields has been studied in [201 , C31 • The exposition is based largely on the works 
of Belavkin [?], [EI, [SI, CHI- 



3.1 Wave Patterns Variances and Invariant Bounds 

We will consider two variants of the lower bound for the variance of parameters 
of wave patterns, both based on noncommutative generalizations |SSIj QHji 
of the Rao-Cramer inequality. In contrast to 35 , '52', these bounds are rep- 
resented in a form invariant under diffeomorphisms, the form will be used to 
obtain generalized uncertainty relations and efficient measurements of canonical 
parameters. 
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3.1.1 Classical Variance Bound 

In Section 12.21 we considered the problem of recognizing pure or mixed wave 
patterns taken from a given finite or denumerable standard family. Generally, 
sound and visual patterns may contain unknown parameters that run through 
an infinite set of continuous values S O of finite or denumerable dimensionality. 
For example, we may not know the mean frequency and the moment when the 
sound signal appears or the mean position and the wave number of the visual 
pattern, or we may a priori have no information on the expected amplitudes of 
the oscillations in the given finite or denumerable family of standard modes. 

It is natural to estimate the unknown parameters by the intensity distribu- 
tions in the representations in which the wave packets with distinct values of 9 
are clearly separated; for example, the frequency and position can be calculated 
as the mean values on the coordinate representation, while the mean time of 
arrival of a signal and the wave number of a wave packet can be calculated in 
the momentum representation (but not vice versa). 

We will call a family of wave packets described in a Hilbert space TC by 
amplitudes {ipg} resolvable in a representation defined by the decomposition of 
unity I = J M{dx) on a given Borel space X if there exists a measurable map 
: X ^ Q satisfying the condition 

j e{x)fig{dx) = e J fig{dx) we e e, (3.1) 

where ^g{dx) = {ipg \ M(dx)'ijjg) is the respective intensity distribution on X. 
Thus, the resolvability of the family {4>g} means that it is possible to calculate 
the unknown 6 € O in a given representation given the observed distribution of 
the Hg as the mean values of a function 6{x) known as the unbiased estimator 
of parameters 0. 

It is natural to define the quality of the resolvability of family {ipg} by the 
size of the variance of the unbiased estimator 9, assuming that the quality for 
a given 6 is all the higher the smaller the standard deviation from 9 in the 
distribution induced on O by the measure fig of the wave packet ipg. To find the 
lower bound for this variance, we can use the classical Rao-Cramer inequality 
PU] if the measure fig possesses the appropriate differentiability properties in 0. 
For the sake of simplicity we take the case of one parameter 6* € M. If we assume 
that there exists a second moment for the logarithmic derivative 7^ = din fig/ dO, 
which is the Radon-Nikodym derivative of measure fi'g = dfig jdB defined by 

\^B{dx)^g(x) = dfig{dx)/d6, (3.2) 

we can easily obtain the inequality 

j {e{x) ~ efflg{dx) ■ j {^g{x) - lgffig{dx) > (3.3) 

where Jg = J fig(dx) is the total intensity of the wave packet ipe^ ^^'^ le the 
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mean value of jg : 

Jele = / ig{x)iiQ{dx) ^ Jg = 2Re(V'e I V'e)- 



Inequality (|3.3|l , which implies the inverse proportionality of the standard devi- 
ation (Tg > or variances 

<^l^ I (H^) - effie{dx)/Je, a? = y ijeix) - 9ff,g{dx)/Je (3.4) 

follows in an obvious manner from the Schwarz inequality if we allow for the 
fact that the right-hand side can be represented, in accordance with H3.1|l . in 
the form of the square of the scalar product 

Je^ J e{x)fi'e{dx) ^0J'g^ J {9{x) - e){%{x) - jg)tig{dx). 

In a more general situation, where the estimated parameters = [0*]™ i are 
differentiable functions 9{a) of unknown parameters a — [ct'']^-i of the den- 
sity operators of mixed wave patterns S{a), we can easily obtain a matrix 
Rao-Cramer inequality that is invariant with respect to the choice of the state 
parameters: 

R > Da^^D\ (3.5) 

where D = [09' /da''] is the Jacobian matrix of the a 9 transformation, 
R = <ygg is the covariance matrix 

fffc(a) = / {9\x) - 9'{a)){9^{x) - e''(a))n{a, dx) (3.6) 



of unbiased estimators 6 {x) with respect to /x(a, da;) = Tr S{a)M{dx), 

\x)fi{a, dx) = 9'{a)J{a), J{a) = Tr5(a), (3.7) 

and cr^i^ is a similar covariance matrix for the logarithmic derivatives 7^, — 
d\nfi{a)/da'^, k = 1, . . . ,n. We will derive the inequality for the general non- 
commutative case. 



3.1.2 Symmetric Nonclassical Bound 

The lower bound of inequality H3.5(l depends, naturally, on the choice of the 
representation determined by the method of measurement, by using the non- 
commutative analog of the logarithmic derivative introduced by Hclstrom, we 
can obtain a more exact bound for the variances of the unbiased estimators that 
does not depend on the choice of representation. 
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If we assume that the family {Sg} of the trace class density operators is 
strongly differentiable in in a certain region 0, we can define a symmetric 
logarithmic derivative by the following equation: 

geSe + Sege = 25^. (3.8) 

It is easy to show (see |3T]) that if |Tr {S'gx)^ < cTr (Sex^) for every Hermitian 
operator x, the solution to this equation exists and is unique, with Tr {Segg) < 

CO. 

Let us consider the operator 



e{x)M{dx), Tr {Sex) = OJe, 
determined by the unbiased estimator 9 for a fixed measurement M . Since 
(0(x) - e)^fie{dx) = Tr (^Sg J {e{x) - efM(dx)^ , 
Tt [Se I {0{x) - x)M{dx)(e{x) - x) + {x - ef] > Tr [Seix ~ d)\ 



it is sufficient to find the lower bound of the variance a\ of operator x. By 
analogy with the commutative case we have 

Je = Tx{xS'e-B3'e)^Tx\{x-Q)S'e\ = \lx\{x-B){gQSe^Seg6)\ 
= iTr [Se{{x ~ 0){ge - jg) + [ge - 7g){x - 9))] 

= (x - 6*139 - 7e>+ • 

Thus, the total intensity Jg is equal to the symmetrized scalar product with 
respect to Sg of the operators x ~ 6 and gg — ^g, where jg = Jg/ Jg is the mean 
value of the operator gg of the logarithmic derivative, and Jg = Tr (Sggg) = 
Tr Sg. Applying the Schwarz inequality 

I 1 2 

\{x-e\gg- Jg)^\ < {x-e\x-0)^{gg-jg\gg- Jg)^ , 

we arrive at the sought inequality: 

crl > Tr [Sg{x ~ ef]/Jg = al> Jg/Tv [Sg{gg - ^gf] = I/cT?,. (3.9) 

Thus, the variance of any unbiased estimator cannot be smaller than the inverse 
variance of the operator of the logarithmic derivative 1)3. 8f) : 

cjlg^Tr[Sg{gg--igf]/Jg. (3.10) 

A similar result can be obtained in the case where there are several param- 
eters 9 = [0*]™ 1 for the estimator 9{x) = \9 (a;)]™ satisfying the unbiasness 
conditions H3.1|) . which when met make matrix (|3.5|l the covariance matrix of 
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estimators 9 , and the mean square error at a fixed Rg assumes the minimal 
value. 

For the covariance matrix Rg , Helstrom has established the lower bound 
by assuming that the operator function Sg = S{6) is differentiable and using 
the concept of the operators gi of partial symmetrized logarithmic derivatives of 
the functions S{d) in 9\ He defined these operators by the following equations: 

g,Sg + Sgg,^2{dSg/d9'). (3.11) 

As in the classical case (JH): this bound is defined by the matrix Gg = [Gik{9)] 
of the covariances of the solutions gi = gi(0) of (|3.1f) . This matrix for noncom- 
mutative gi is taken in symmetrized form 

Gtk{0) = ]^TvSg{gigk + gkgi) (3-12) 

(the mathematical expectations of Tr (Sggi{9)) are equal to zero). The corre- 
sponding inequality has the form 

Re>Gg\ 9ee (3.13) 

and is understood as the nonnegative definiteness of matrix [R'^'^{9) — G^''{9)], 
where G'^^{9) are the elements of the inverse matrix G"^ : G'^ {9) Gjk{9) = SI. 
Inequality (|3.13|l is the noncommutative analog of the Rao-Cramer inequality 
|4t)| . The matrix Gg plays the role of a metric tensor locally defining the distance 
(t{9,0 + d9) = Gik{9) d9^ d9^ in the parameters space Q, similar to the Fisher 
information distance in classical statistics. 

We now turn to a more general situation where the state parameters are 
not the measured parameters 9^ but other parameters a = {a^ , k = 1, . . . , n}, 
S — S{a). The parameters 9^ are differentiable functions 9^ = 9^ (a) of the 
unknown parameters. The respective generalized Helstrom inequality H3.13|l 

represents a bound for the covariance matrix R = R{a) of the estimators 9 in 
a form invariant with respect to the choice of the variables of states S{a), 

R>DG-^D'', (3.14) 

where D [89' /da''] , and G = G(a) is the covariance matrix (|3.12l) of the 
operators gk — gk{oi) of symmetrized logarithmic derivatives of the operator 
function S{a) in q*^. 

Inequality (|3.14|) . which is equivalent to inequality (|3.13|) only if m = n and 
matrix D = D{a) is nonsingular, can be verified by a line of reasoning similar 
to the one that will lead us to inequality (|3.17() (see Section l3.2.4() . 

Inequality can be reduced to the classical Rao-Cramer inequality only 

where the family {S{a)} is commutative. For noncommutative families other 
generalizations of the Rao-Cramer inequality are possible, generalizations 
that are based on other definitions of logarithmic derivatives and that lead to 
other lower bounds for R differing from the invariant Helstrom bound DG^^D'^ . 
For real-valued parameters a these generalizations may serve equally well as 
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analogs of the Rao-Cramer inequality and coincide only if {S{a)} constitutes 
a commutative family, in which case they are reduced to the classical Rao- 
Cramer inequality. However, in the event of complex-valued parameters a a 
special invariant generalization of the Rao-Cramer inequality becomes especially 
important. It is based on the notions of right and left logarithmic derivatives 
and was suggested independently by Belavkin and Yuen and Lax [5^ . 

From now on we shall consider complex parameters a*^ = a J -|-ja2i = 
{a''} € C". The estimated parameters 9^ = 9\a, a), i — 1, . . . ,m, are assumed 
to be functions independently differentiable in a and a.^ Let us define the 
non-Hcrmitian logarithmic derivatives of S* = S{a, a) thus: 

Shk = dS/da'', his = dS/da^, k=l,...,n. (3.15) 

The operators hk — hk {a, a) are called the right derivatives with respect to 
a'^ (and the operators h'^, the left derivatives with respect to a) and have zero 
mathematical expectations. The covariance matrix H = [Hik] of 

H^kia,a) = Tt [S{a,a)hihl] (3.16) 

is Hermitian and, assuming it is nonsingular, defines a positive definite metric 
ds^ + Hik daMa'" in a complex domain O C C" of the unknowns a G O. 

Suppose that a joint measurement of the parameters 9^ is described by a de- 
composition of unity that defines the estimator 9. This estimator is represented 
by a vector quantity that, in general, assumes complex values x — {x^} G C™, 
is represented by a conditional distribution fi(dx | a, a) = Tr [AI(dx)S{a,a)], 

and satisfies the unbiasness condition (j) ^ = 9^ {a, a). Then the mean square 

error of measurement is determined by the matrix R = R{a, a) of covariances 

R''' ^ (^{^ - 9'){9'' - 6''')*^, for which the following inequality holds true: 

R>DH-^D\ (3.17) 

where D = D{a, a), a.s in 13.14|l . is the matrix of the derivatives 89^ /da^, and 
Z)^ is the respective Hermitian conjugate matrix. Even in the real case, that is 

9=9, inequality (|3.17|) leads to a lower bound that differs from the Helstrom 
bound (|3.14() . We will call the lower bound in (|3.17() the right bound. Other 
bounds can also be considered, say, the left bound, which is based on the left 
logarithmic derivatives with respect to a. The proof of all such inequalities is 
similar to that of inequality H3.17|l . which is given in Section [3.2.41 The right 

^The derivatives d/da and d/da are defined in terms of the partial derivatives d/dai and 
d/da2 in the common manner: 

d/da = —{d/dai + jd/da2), 

d/da = -{d/dai + jd/da2). 



3.2. UNCERTAINTY RELATIONS AND EFFICIENT MEASUREMENTS63 



bound in H3.17|l is invariant under replacement of derivatives with respect to 
a*^, by derivatives with respect to new variables p'' = (i^{a) only if the func- 
tions (a) are analytic, that is, dfi^ jdot — 0, and the matrix of derivatives 
is nonsingular. Hence, the use of inequality H3.17|l in invariant form 
R > H~^, where, as in we employ derivatives with respect to the esti- 

mated parameters 9^ (but, in contrast to right derivatives rather than 

symmetrized are used), is inexpedient since the condition for the equivalence of 
these inequalities includes not only the condition that matrix D be nonsingular 
but the analyticity condition dO^ /da^ = as well (that is, the independence of 
functions 0*(a, a) on a), which is not out initial assumption. A similar situation 
for complex-valued parameters a exists in the classical case. 

3.2 Uncertainty Relations and Efficient Measure- 
ments 

In this section we will introduce the notion of canonical families of wave pat- 
terns for whose canonical parameters we will establish uncertainty relations 
that generalize the quantum mechanical uncertainty relations obtained in the 
one-dimensional case by Helstrom |31) and in the case of multidimensional Lie 
algebra by Belavkin JHI ■ We will then find the limit of accuracy in estimating 
the canonical Lie parameters of wave patterns and prove that such limits are 
exact only for canonical signals for which there are efficient measurement or 
quasimeasurement procedures. The discourse will follow the scheme suggested 
in |18|: for examples of uncertainty relations for quantum systems the readers 
is advised to turn to PH | .[ST ] . 

3.2.1 Canonical Families and Uncertainty Relations 

In classical mathematical statistics an important role is played by canonical, or 
exponential, families of probability distributions, for which a special selection 
of parameters 9 and a makes the Rao- Cramer bound exact. In Section [3 . 2 . 31 we 
will prove that in the noncommutative case a similar role is played by density 
operators of the form 

^(/3j)=X"V*^5oe^"s (3.18) 

where the Xk,k = 1, . . . , n, are linearly independent operators in 7i, which may 
be non-Hermitian (i^ Xk) and may not commute with the conjugate operators 
{xiX*j, ^ x*j,Xi), and x — xiPiP) is the generating function of the moments of 
these operators in state Sq: 

x(/3,^) =Tr5oe'^'*'-e'3'*S (3.19) 

which is finite (x < oo) in a neighborhood of zero /3 = of the complex space 
C". The family of density operators H3.18() will be said to be canonical and the 
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parameters /3'^, canonically conjugate to the Xk- In contrast to the commuta- 
tive case, even for Hermitian operators Xk it is meaningful to assume that the 
conjugate parameters may have complex values. 

Of special interest is the case, which has no classical analog, of canonical 
states HH.18|I where the p'' are imaginary and the Xk are Hermitian. The pa- 
rameters — Im/3'°/(27r) acquire a dimensionality and meaning of quantities 
that are dynamically conjugate to the Xk] for instance, if x is frequency, 6 is 
time, if x is momentum, 6 is displacement, if x is angular momentum, 6 is the 
angle of rotation. The canonical states (|3.18(l at p'^ = 27rj0'^ assume the form 

and are unitary equivalent to state So, which corresponds to a zero value of 9. 
It has been established that if we put a = P and apply inequality H3.17|) to the 
canonical family (|3.20() . we arrive at the exact formulation of the generalized 

uncertainty principle for any pair of dynamically conjugate quantities 9 and 
x^, where the first quantity in the pair may not correspond to the Hermitian 
operator that meaningfully describes in H the measurement of this quantity.^ 

Differentiation H3.18|l with respect to P and comparing the result with H3.1(l , 
we get 

k = e-'^"'=x jT(x-'e^'*'«) = x^CP) - 9,, (3.21) 

where x^{P) = e-/3"*fc _|,e^'^% and 9i = Inx = Tr [Sx^{P)]. Matrix (TTTCIl . 
therefore, is the covariance matrix. 

o2 1 

= Tr [S{x,CP) - 0,){xu{P) - 9^)*] = (3.22) 

dp dp 

of the operators Xi{p) analytic in /? and coinciding with Xi dX P = 0. The 
inequality l|3.17|l in the neighborhood of point /3 = 0, therefore, can be written 
in the form of the uncertainty relation 

R>DS-^D\ (3.23) 

which establishes the inverse proportionality between he matrix S — [Stk] of the 
covariances 

= Tr [SiP,P)ix, - /i,)(ifc - /iJI (3.24) 
of the operators Xi, Tr [S{P,P)xi] = /ij, and the covariance matrix R of the 
estimators 9 of the functions 9^{P,P) of the conjugate parameters P and p. 

■^The Heisenberg uncertainty principle is usually proved only for such dynamically con- 
jugate quantities described by noncommutative operatiors p and q that satisfy, say, the 
commutation relations [p,q] = l/27rj. The proof employs the well-known scalar inequality 

(p))^) (^{4 ~ (9))^^ ^ l([P> 9])!^ /4, which is valid for any pair of operators p and q. 
Strenthening and matrix multidimensional generalization of this inequality in terms of the 
covariance estimations of an arbitrary family of noncommutative operators are suggested in 
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At 9 = P, 13.23|l transforms into strict inequality in the entire domain 9/3. 
Putting 9 = Im/3/(27r) and allowing for the fact that d9/df3 ~ l/47rj, we obtain 
x'' — X the generalized uncertainty relation 

{2TrfRe > (1/4)5-^ (3.25) 

in terms of the variances Rg — (^{9 — 9)"^^ , with S* = Tr [5*0(2; — /^)^], valid 

for any pair of dynamically conjugate quantities 9 and x defining the canonical 
family (|3.20() . For the quantum case of pure states 5o = |V'o)(V'ol' the scalar 
inequality (j3.13|l by Helstrom [21] by a complicated procedure for calculating 
the matrix elements of the operators of symmetrized logarithmic derivatives. 

In the multidimensional case, when the operators Xk are pairwise commuta- 
tive (but not necessarily with x^. and 5o), the situation is the same: XkiP) = Xk 
for every (3 G O and inequality (|3.23(l is strict. The averages ^j, and the covari- 
ances ()3.24(l at Xk = and (3'^ — 2'it]9^ are independent of 9 and, therefore, 
coincide with the respective values at 9 = 9: /ij, = Tr (SoXk) and 

S.tk = Tr [So{xi - fJ.i)(xk - Aife)]. (3.26) 

The uncertainty relation (|3.25|l in this case acquires a matrix meaning: Rg is 
the covariance matrix of estimators 9 of the canonical parameters belonging 



to a translation group in state Se, and 5 is the covariance matrix (|3.26|l of the 
generators Xk of tlic group, defining the lower bound S^^ /Wn'^ for Rg uniformly 
in every 6* G M". 

We now take up the case of noncommutative {xk}. Suppose that the oper- 
ators Xk form a Lie algebra: 

XiXk X}^Xi — ^ik^j^ (3.27) 

where C^^. are structure constants. Here the operators Xi{(3) in (|3.21(l are linear 
combinations of the generators Xi : 

xS)^L-\-P)lxj, (3.28) 
with L{^) — ^^Ck{I — 6-^*-^)"^ an rt-by-n matrix that exists in a neighborhood 



O e C" of zero ^ = 0, and the Ck 



^ik 



are the generators of the adjoint 



representation of the commutation relations H3.27(l . Expressing the covariance 
matrix H of the operators H3.28|l in terms of the covariances H3.24(l of the gen- 
erators Xi, we get instead of (|3.23|) the inequality 

R>DL^S-^LD\ (3.29) 

where L = L{~(i). In the case of (|3.20() . the family of Sg is unitary homoge- 
neous with respect to the Lie group with Hermitian operators Xk and canonical 
parameters 9^ . Similarly to H3.25|l . we obtain a more general relationship 

(2T:fRg > ^LlS-^Lg, (3.30) 
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where Le = O^Gi{I - e-j^'°'^'=)-\ Gk = 27rjCfc. Inequahty ipOT))! determines in 
the domain O C M" of the convergence of the series expansion 

the lower bound of the mean square error in estimating the canonical parameters 
of the unitary representation e^^^^ of a Lie group. 

3.2.2 Efficient Measurements and Quasimeasurements 

In classical statistics, estimations whose covariance matrix assumes the minimal 
value and thus transforms, locally or globally, the Rao-Cramer inequality into 
an equality are known as efhcient (locally or globally, respectively). In the 
noncommutative case, the concept of efficiency introduced by analogy with the 
classical concept looses its universality because the generalization of the Rao- 
Cramer inequality is not unique and the definitions of locally efficient estimates 
|SS|, [Sni; based on different variants of this generalization are not equivalent. 
For this reason we distinguish between the efficient measurements (or estimates) 
for which the invariant Helstrom bound H3.14|l is attained and those for which 
the right bound (|3.17() is attained, with the former called Helstrom efficient and 
the latter, right efficient. As we show below, the notion of right efficiency is more 
universal: measurements that are Helstrom efficient are right efficient, but not 
vice versa. Let us first prove that Helstrom efficient estimates exist globally for 
canonical families of density operators H3.18|l if the operators Xk are Hermitian 
and pairwise commutative and if for the estimated parameters 9 we take the 
derivatives 6k — 91nx/9>f'^ of the generating function x{^) ~ Tr(S'oe^ ^'=), 
where x = (3 + fi. The parameters 9k selected in this manner coincide with the 
averages defined by the canonical subfamilies of the density operators, 

5(>^) =X"'(>^)e"'*'=/'5oe"'°"'=/', (3.31) 

with Im^'' = 0: 

9k{>i) = Tr [S{>i)xk\ =d\nx/d>c''. (3.32) 

Taking for parameters the canonical parameters and differentiating the 
operator functions l|3.31(l , we obtain the symmetrized logarithmic derivatives in 
: cjk = Xk — 9k- Thus, the covariances (|3.12|l coincide with the covariances of 
operators x^, 

G,k = Tr [S{x){x, - 9,){xk - 9k)] - (3.33) 

which are equal to the derivatives Dik — d9i jdx^ defining matrix D in 13.14|l for 
OL = X- Therefore, inequality (|3.14|) assumes the form R> G, or [Rik — Gik] > 
0, where Rik ~ {(Pi^ 9i)(9k — 9k)^ are the covariance of the unbiased es- 
timators 9k ■ (9k) ~ 9k- If for these estimators we take the results Xk of 
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measurements of the observables Xk (which are compatible) , then matrix R as- 
sumes the minimal value R — G. Thus, for the canonical families (|3.31|l with 
commutative operators Xk there exists a Helstrom efficient measurement of the 
functions H3.32|l of the canonical parameters , which is the usual compatible 
measurement of the observables Xk. The domain of this efficiency, obviously, 
coincides with the domain O C M" for which x(^) < x £ 0. It has been 
established that the converse is true in the following sense. 

Let the estimators 9k (i-e. the results of a measurement) have averages 9k{a) 
and covariances Rik{a) that are differentiable in a certain domain, and let the 
matrices R = [Rik (a)], D = [89 /da''] satisfy the conditions 

d{R-^D)l/da' = d{R-^Dl)/da*' (3.34) 

(the regularity conditions). We can then introduce the canonical parameters 
^fc _ >f'=(Q;) defined uniquely by the derivatives d>c^ /da'' = {R^^D)\, if we put 
>ir'^(ao) = for a fixed ao. It can easily be verified that for a family of density op- 
erators S{a) of canonical form H3.31|l . with k'' = >t*'{a) differentiable functions 
possessing a nonzero Jacobian, the regularity conditions are met in an efficient 
measurement at 9k {a) = dhi{x{>c{a)) / dx^ such that Rik — Gik{>c{a)) and 
(i?~^G)^ = dx'' /da^ . Proof of the converse assertion that under the regular- 
ity conditions the global Helstrom efficiency comes into play only for canonical 
families H3.31|l is given in Section 13.2.41 for a more general situation involving 
complex variables. 

Hence we have proved the following 

Theorem 10 Under appropriate regularity conditions, inequality (|3.14|l is trans- 
formed into an equality in a certain domain O C M" if and only if the fam- 
ily of density operators S{a) has the canonical form (|3.14(l . where Xk, k — 
1, . . . , 71, are commutative Hermitian operators in Ti, and the canonical param- 
eters , k — 1, . . . ,n, are functions of parameters a defined by the equations 

dlnx/dx'' = 9k{a), k = l,...,n. 

3.2.3 The Theorem Regarding the Canonical Density Op- 
erators 

Suppose that in a certain domain O C C" the unbiased estimators 9k possess 
averages 9k{a, a) and covariances Rik{a, a) that satisfy the regularity conditions 
H3.34|l . to which we adjoin the analyticity condition 

^R^'D = 0. (3.35) 

Here we can introduce, as we did in Section 13.2.11 canonically conjugate pa- 
rameters = I3^{a) via the equations d0^ /dc^ = (i?~^Z3)^ and conditions 
(5^(aQ) = for a fixed ao G O with the functions /S*^ (a) being analytic in view 
of conditions (|3.35|) . 
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Theorem 11 Under the formulated regularity conditions, inequality (|3.17|l is 
transformed into an equality in a certain domain O C C" if and only if the 
family {S{a,a),a S O} has the canonical form (|3.18l) . with So = 5(ao,ao) for 
an ao £ O, the operators Xk, k — 1,. . . ,n, simultaneously possess in Ti. the 
property of the right proper decomposition of unity 

I ^ I M{dx), XkM{dx) ^ XkM{dx), a; = {xfc}eC", (3.36) 



and the parameters , k = 1, . . . ,n, are analytic functions f3^(a) determined 
by the equations 

d\nx/dj3 =9k{a,a), a£0. 

Optimal estimation is then reduced to a quasimeasurement of the non-Hermitian 
operators Xk described by the decomposition of unity H3.36|l , while the minimal 
mean square error is determined by the covariance matrix 

R,k = Tr [S{x, - e,){xk - Ou)*]. (3.37) 

Proof. Sufficiency is proved in the same way as in Section l^.2.11 Employing the 
fact of invariance oft he right bound H3.17|l under the analytic transformations 
a ^ /3, we select for the variables determining this bound the parameters 
p'^ of the family of density operators H3.18|l . The elements dOi/dfi'^ of matrix D 
then coincide with the elements (|3.22|l of matrix H if we allow for the fact that 
9-1 = d\nx/ dff . Since according to H3.36|l the operators Xk are commutative, 
XiXk = J XiXkM{dx) = XkXi, we have 9k = /^^ and Hik = Sik, where the 
/ij. are the averages of the Xk, and the Sik are the covariances (13.24(1 of these 
operators. Hence, inequality H3.17|l assumes the form R > S. What remains 
to be proved is that the measurement described by the decomposition of unity 
(|3.36|l leads to an estimation for which R = S even when the operators are not 
commutative with the respective conjugates: XiX^ ^ x\xi (which occurs when 
decomposition (|3.36(l is nonorthogonal) . To do this, it is sufficient to allow for 
the representation 

x^^ I xM{dx), Xixl = I x^XkM{dx), X G C", (3.38) 



which is obtained by integrating the equations in (|3.36|) and the adjoint equation 
M{dx)x^ = XkM{dx). Thanks to H3.38|l the covariances 

Rrk = j{x,- e,){xk - 9k)Tv [SM{dx)] (3.39) 

of the estimators Ok obtained as a result of a quasimeasurement of operators Xk 
coincide with the covariances Sik of these operators, which proves the efficiency 
of this quasimeasurement for the density operators (|3.18l) . The proof of the 
converse of Theorem ^2 follows from the derivation of inequality (|3.17l) and will 
be discussed in Section lB.2.41 ■ 
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3.2.4 Discussion and an Example 

Thus, the condition of (right) efficiency requires the existence of commutative 
operators possessing a joint right spectral decomposition and playing the role of 
sufficient statistics, which it is natural to call right efficient. Here it is sufficient 
to restrict the discussion to the operators in the minimal subspace generated by 
the regions S{(3,j3)H with the density operators S{/3,j3) for all /3(a) G C" for 
which a Cz O. Even if we consider only the real values of parameters 6 {a, a), 
optimal estimation can be described by non-Hermitian and noncommutative 
(with the conjugate) operators of the right-efficient statistics and, therefore, 
may not be Helstrom efficient. However, estimates that are Helstrom efficient 
correspond, according to Theorem 1101 to the particular case of right efficiency 
where the Xk are Hermitian. If the operators Xk in (|3.18() are non-Hermitian 
but commutative with the conjugate operators, the right-efficient estimates also 
coincide with complexified estimates, which are Helstrom efficient. However, 
commutativity XkX* = x*Xk, may not occur either. 

Example 12 Let Xk — (^j,(a), where the </?j, are entire functions C —t C, and 
let a = {fli, z = 1, . . . , r} be the annihilation operators satisfying the commuta- 
tion relations 

CLiCLk ~ CLkOi ^ Q, didl ~ aldi ^ S'li. 

Show the right efficient measurement for the parameters 0k ~ dlnx/ djf^ of the 
density operators (|3.18(l . 

It is well-known that the operators d have right eigenvectors |a) € 7i, a € 
C, that define the nonorthogonal decomposition of unity 

1 = J |Q!)(a| ]^7r~"'^dRcaj:dImQ;i, di\a) = ai\a). 

i—l 

It is obvious then that the operators x = ip{d) have a right proper decomposition 
of unity H3.36|l . where 

M{dx)=dx / S{x — (p{a)) \a){a\Y\'^~^'i^sctidlmai 

i=l 

{Ax is the Lebesgue measure on C", and 5{x — (p) is the Dirac delta function). 

— k 

Hence, optimal estimation of the parameters 6k — dlnx/dp of the density 
operators (|3.18|) aX x — (p{d) is right efficient and can be reduced to a coherent 
measurement and estimation oi 6 = ip{a) by the result a. In the particular case 
where (p{a) is a linear function and So is a Gaussian state this fact has been 
established in jEj- 

Note that along with right and left lower bounds one can consider other 
combined bounds via the factorization 6 — 6^+9^ by appropriately defining 
the right derivatives with respect to 0+ and the left derivatives with respect to 
An interesting question arising in this connection is whether the class of 
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efRcient estimations is exhausted by the estimations for which at least one such 
bound is attained. 

Let us now consider the (right) efficiency of estimating the parameters /S^ of 
the canonical families ij^SJ. The inequality (|3.17|l corresponding to this case 
with 0'' ~ p'' has the form R > H~^, where H is the matrix of derivatives 
(|3.22|l . Without loss of generality, we can assume that Tr {xkSo) = 0. 

Theorem 13 The inequality R > H^^ transforms into an equality if and only 
if the operators Xk in H3.18|l possess a right joint decomposition of unity ()3.36|l . 
the generating function of the moments ()r{.19|l of these operators in state So 
is Gaussian, xiPiP) = exp{/3*i?ifc/3''}, with Hik independent of f3 and /3 and 
linear functions y'^ = {H~^)^^Xi of the results Xk of joint quasimeasurement of 

-k 

observables Xk are selected for the estimators (3 . 

Proof. Sufficiency of the above- formulated conditions for the existence of right- 
efficient estimation is obvious: the fact that matrix H coincides with the covari- 
ance matrix S of operators Xk implies that the covariance matrix R = H~^SH~^ 
is equal to H^^. Necessity follows from the necessary conditions of right effi- 
ciency in Theorem llll according to which the family S{f3, (3) must have the form 

S{(3, p) = ^-^e'^'-y"' Soe'^'^y" , (3.40) 

where ^ = Tr [S'oe^*j'''e^'=^'"], p'' = d{9k\nij) /d9k, and the operators y'' pos- 
sess the joint right unity decomposition in the sense: 

/ = y M{dx), fM{dy) = y^'Midy), y = (y'^) G C". 

Comparing (|3.3HI with H3.18|l . we conclude that 9ky'' = p'^Xk, whence 

9k = Hk^~P\ i^{9,9) = x{l3,~P) = ~P"H,kp\ t = {H-^f'x,. 
The proof of Theorem II 31 is complete. ■ 

Proof of Inequality ()3.17p 

1. We start with the one-dimensional case. Let x be a non-Hcrmitian operator 
in TL for which 

Ti[xS[a,a)]^9{a,a). (3.41) 

Differentiating H3.41|l with respect to a and employing definition (|3.15() and the 
normalization condition Tr S{a^a) — 1, according to which Tr (Sh*) — 0, we 
obtain 

d9/da ^Ti [S{x~9)h*]. 
Since the covariance Tr [S{x — 9)h*] obeys the Schwarz inequality 

|Tr [S{x - 9)h*]\^ < Tr [S{x - 9){x - 6l)*]Tr {Shh*), (3.42) 



3.2. UNCERTAINTY RELATIONS AND EFFICIENT MEASUREMENTS71 



which reflects the fact that the determinant of the 2-by-2 covariance matrix 
Tr [Shih*j,)^ i = 0, 1, with /ig = {x — 0) and hi — h, is nonnegative, we can write 

Tr [S{x ~ 0){x* -e)> \de/da\^/TT {Shh*). (3.43) 

This inequality, obviously, specifies the lower bound on the variance of the 
estimation of parameter = 9 (a, a) in the class of ordinary measurements 
described by normal operators x. But since the normality condition, xx* — x*x, 
was not used in deriving H3.43() . this bound is the lower one for the variance of 
any estimators 9 obtained as a result of arbitrary generalized measurements 
described in H by decompositions of unity I — J M{dx), x G C that may be 
nonorthogonal. Indeed, the nonnegative definiteness 

{x-x)M{dx)ix-x)* >0 (Af>0) (3.44) 

implies 

J \x- 9\^M{dx) >{x- 9){x - 9)*, (3.45) 

where x = J xMidx), and 9 — Tv (Sx). Taking the mathematical expectations 
of both sides of (|3.44l) , allowing for the fact that the variance R of the estimator 
^ = a; is equal to Tr 5 x J \x ~ 9\'^M{dx), and combining the result with (|3.43|l . 
we find that 

i? > Tr [S{x - 9){x = 61)*] > \Df/H, (3.46) 

where D = 89/ da, and H = Tr (Shh*). This proves inequality H3.17|l for the 
one-dimensional case. 

2. The equality in (|3.46l) occurs if, first, the averages of both sides of H3.45|l 
coincide and if, second, the Schwarz inequality transforms into an equality. 
Actually, the first requirement establishes an equality in H3.44|l . Specifically, we 
have the following. 

Lemma 14 Suppose that the ranges S{a, a)Ti. of the density operators from 
a family {S{a,a), a S O} generate the entire space TL. Then the fact that 
Tr (5^4) = for every nonnegative definite operator A in Ti. and all a € O 
implies that A = Q. 

Proof. It is sufficient to prove that in 7Y there is no vector x of the form x = 
S^^^ip for which (xl^lx) 7^ 0. But this follows from the well-known inequality 

Tr (S^^^AS^^^) > (V'|S'^/^AS'i/2|^)^ 

which is true for every nonnegative A at (jp \ ip) — 1. 

Applying this result to the operator A that is equal to the difference be- 
tween the right- and left-hand sides of H3.45|l . we find that under the lemma's 
hypothesis the equality in H3.45|l occurs only if 



(x - x)M {dx){x - x)* = 0, i.e. xM{dx) = xM{dx). 
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This proves that right-efficient estimation in a certain region O B a exists if 
there is an operator of minimal sufficient statistics, x, possessing a right proper 
decomposition of unity in the subspace generated by the subspaces S(a,a)T-L. 
In the real S M, such an operator x is obviously Hermitian. 

The second requirement for equality to occurs in (|3.46(l is equivalent to the 
condition of linear dependence, sh — XS{x — 9), where A — D/R if the first 
condition for equality in (|3.45() is met. Extending this condition over the entire 
region O 9 a in which the analyticity condition (|3.25|l . dX/da = 0, is assumed 
to hold true, we arrive at the equation 

dS/da = XS{x - 0), dS/da = (i - 9)*SX 

in 5" = S{a, a). Its solution combined with the boundary condition S{ao, ao) = 
So has the canonical form (|3.18|l . where /3 (a) = A(q;) is an analytic function, 
and X is the operator of right-efficient statistics. This proves that in the one- 
dimensional case the existence of right-efficient estimation requires that the 
density operators S{a, a) be canonical. This condition is formulated in Theorem 
111! For the real case, x* — x, this fact also proves the necessity in Theorem 1 101 
3. The multidimensional generalization can be carried out if for x — 6 
and h we take the sums (a;* — 0^)fi^ and h^S, , where rj^, i ~ l,...,m and 
, A: = 1, . . . , n, are complex numbers. If we allow for the fact that here 

Tr [S{x - 9)h*] = ri,{de'/da'')C\ 

then from (|^^ at C'' = [H-^D'^f'^ rj^ we arrive at the inequality 

R'%riu > Tr [Six^ - 6^){x'^ - O^Yr^.r^^] > (DH-^ D^y^f/rVk 

valid for an arbitrary a;* for which Tr (Sx^) — 6**. Putting = / x'^M{dx), 
where / M{dx) ~ I, x £ C™, is the decomposition of unity describing the 

estimator 9 — x^, and applying inequality (|3.45|) with x — x^fj^ and 9 = 9^fj^^ 

we obtain for the matrix R of covariances of 9 the first inequality in (|3.46|l . 
which in view of the arbitrariness of ?7j yields H3.17|l . 

Inequality H3.46|l transforms into an equality at a e O only when x^M{dx) = 
x^M{dx) and dS/da^ — XkiS{x^ — 9^), where Xik — {R^^D)ik, whence, if we 
allow for the regularity conditions A^fc, we arrive at (|3.18l) . ■ 

3.3 Optimal and Covariant Estimation of Wave 
Patterns 

In this section we will consider the necessary and sufficient conditions for the op- 
timality of measuring sound and visual patterns by the criterion of mean square 
error in parameter estimation and by the maximal intensity criterion. To avoid 
substantiation of the operator integrals involved in the discussion this is done 
in [37], we interpret them as operator- valued Radon measures. The solution 
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to the optimal measurement problem will be found for homogeneous families 
of wave patterns for which it coincides with optimal covariant measurements of 
the corresponding parameters of quantized fields, with the latter measurements 
introduced in jl3| . 

3.3.1 Optimal Measurements 

The problems of optimal estimation of continuous wave parameters constitute 
essentially multialternative problems with an infinite-dimensional solution space 
(or manifold) X. without loss of generality, we can assume that the information 
parameter space 9 coincides with X equipped with measure dA. Let us assume 
that a wave signal, which in general is described by a density operator S, depends 
in a continuous manner on real- or complex-valued random parameters 9 = 
{9i, . . . , 9„), S = Se, having a given a priori distribution P(d9). The deviation 
of the estimate x ^ X from 9 is penalized by an integrable cost function Cx{9) 
of the form, say, {x — 9^ . On X we must find an optimal quasimeasurement 
that (a) is described by an operator- valued measure M{dx), (b) determines the 
decomposition of unity in the Hilbert space 7i, and (c) minimizes the mean 
estimation cost 



where fig{dx) = Tr M{dx)Sg is the observed intensity distribution on X for a 
given 9, and Rx — J Cx{0) SeP{d9) is the operator of the mean cost x £ X. We 
will now formulate the necessary and sufficient conditions for the optimality of 
solution M° to this extremal problem, which in were introduced to estimate 
the parameters of quantum states. This will be done in a manner similar to that 
of Theorem O 

Theorem 15 The lower bound 



is attained on measure M° if and only if for almost all x £ X there exists a 
minorant operator A° < R^ such that 



The operator A° is a trace class operator, or Tr A° = (A°, I) < oo, that deter- 
mines the solution to the duality problem 





(3.47) 



{Rx - A°)M°{dx) =0 VxeX. 



(3.48) 



sup{(A,/)|A<ii'„ xeX} 



(3.49) 



A 



for which conditions (|3.48|l are also necessary and sufficient (if we allow for the 
fact that M° > and J M°{dx) = E). 
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Proof. For a proof of this theorem as well as for the existence conditions for a 
solution see P7j . 

Allowing for the fact that the operators M{dx) can be decomposed into 
operators of the form |x2:)(Xa;l dA(a;), where the Xx ^'^^ the generalized elements 
of space 7i, we find that the problem of optimal estimation of wave parameters 
will be solved if and only if we can find a family of reference waves, {Xx\ 
satisfying the completeness condition 



|xJ(xJdA(a:)=/ (3.50) 

and a Hermitian operator A for which 

i?, =A>0, (i?, -A)x, = 0, xeX. (3.51) 

Note that, in contrast to problems of signal discrimination, in problems of 
parameter estimation the commutative case RxRx' = Rx'Rx, which can be 
reduced to the classical case, is of no practical interest and will not be discussed 
here. 

The solution of problem 13.51|l poses no fundamental difficulties in the case 
of a single unknown real- valued parameter 0{x = M^) and a quadratic penalty 
function 

Cxi9) - {x-Of. 
The mean estimation cost operator 



Rx = J (x- eysgP{de) 

in the case of (|3.51|l can be represented, via three Hermitian operators 



R^^'> = J e''SgP{d0), A: ==0,1, 2, (3.52) 



in the form 



= (i - a;)i?(°) {x-x) + - xR^°^x, 

where x is an operator satisfying the equation 

ii?(")+ii'(°)i = 2i?(^). (3.53) 

and for Xx take the complete orthogonal system of generalized eigenvectors 
determining the spectral decomposition of the Hermitian operator x, 



ix-x)x.^O,x^Jx\xJiXx\dx, 

the conditions H3.51II are satisfied in an obvious manner: 

(x - a;)i?W {x-x)> 0, {x - (x - x)x^ ^ 0. 
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Thus, the solution of the parameter estimation problem by criterion H3.51|l is 
reduced to measuring operator x satisfying H3.53|l . The result of such a mea- 
surement, X, leads to the minimal error (c) = Tr A° equal to the a posteriori 
variance 

^2 ^ Tr _ iR(0)^)_ 

■ 

As an example, let us consider the estimation of the amplitude of a coherent 
signal of known shape received against a background of Gaussian noise. The 
density operator of the corresponding mode has the Gaussian form 

S{e) = J |a)(a|n~^exp|- ^"~^^ | Tr'MReQdlma, (3.54) 

where 9 is the amplitude, which assumes real values, G M^. We assume that 
amplitude 9 has a Gaussian prior density 

p{d) = {2TTs)-^^^cxp{^9^/2s}, 

where s is the prior variance, (0^) = s. It is then easy to find, via the formulas 
of Gaussian integration, the operators H3.52|l . which define the mean decision 
cost operator 

R.^(x '-^q) S (x + + 

V 2s + n + l/2^ J V 2s + n + l/2^ J 2s + n + l/2 

Here Q = jRea|Q!)(a|7r^^ dRea dim a is the operator of the "coordinate" of 
the harmonic oscillator representing this mode, and 

S= [ |a)(a|((2s- + n)n)-i/^exp(-^^^-^^^l7r-idReadIma 
J [ 2s + n n } 

is the density operator, with J Sgp{0) d9 — R^^\ Hence, optimal estimation 
of the amplitude of a Gaussian signal is reduced to measuring the coordinate 
operator Q, whose result q determines the optimal estimate 

X = 2sq/{2s + n + l/2) 

with a minimal mean square error 

0-2 = 2s(n + 1/2) (2s + 7l + 1/2) . 

3.3.2 Miltidimensional Optimal Estimation Problem 

In the case (n > 1) even for the quadratic quality criterion 

n 

C.{0)^Y.^x,~9,f 
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the general solution to problem (|3.51l) is unknown. Only in the particular case 
where the operators x — {xj} obeying (|3.53() . with 

r'' = Rj^ J e^jSgp^e) dOi... d^,,, fc = o, i, 2, 

commute with each other [x ), optimal estimation is reduced, obvi- 

ously, to joint measurement of these operators. 

In general, a good estimate of parameters 9j can be obtained by an indirect 
measurement of noncommutative operators {xi} (see Section l3.2.2|l . However, 
this estimate is not necessarily optimal, even if the indirect measurement is 
ideal. 

For an example let us take the complex-valued one-dimensional case {X = 
C^), which can also be interpreted as the real- valued two-dimensional: 

C^(6») = |x - = (Re(a; - d)f + (Im(.T - ejf. 

An exact solution to the problem of optimal estimation of a single parameter 
9 has been obtained in ::30_ for this case of a quadratic penalty function, a 
Gaussian state Sg; of the form l|3.54|l . with 9 E C^, and a Gaussian prior 
probability density 

p{0)^s-'eM-\d\Vs}, s = {\9\'). (3.55) 

The density p{9) is normalized with respect to 

dX{9) TT-^dRe9dlm9. 

In this case, by the standard formulas of Gaussian integration, we can easily 
find the mean decision cost operator 

R. = (x* - —^-A*) s(x~ ^) dXia) 
\ n + s + L J \ s + n J 

is the density operator S — j Sgp{9) d\{9). Assuming that 

s -\-n + l \ s } / 

where |a), a = (1 -I- (n -|- l)/s)x, are coherent vectors, and c = s/(s -I- n -|- 1) 
is a coefficient that can be found from condition (|3.5U|) if we allow for the 
completeness of coherent states 

y" |a)(a| dA(a) = /, 

and allowing for the equation A\a) = a|a), we find that conditions (|3.51|) are 
met: 

(x* -cA*)S(x-cA)>^, {x* ~cA*)S{x-cA)\c-^x)^Q. (3.56) 
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Thus, optimal estimation in the one-dimensional complex- valued quadratic- 
Gaussian case is reduced to a coherent measurement describing an ideal proper 
indirect measurement of the annihilation operator A whose result a determines 
the estimate 

s 



with a minimal error 

n s(n + 1) 

s + n + 1 

This error is equal to the error of the appropriate classical problem of estimation 
in a Gaussian linear channel with a noise intensity of n + 1. The quantity n 
(the mean number of the noise quanta) is determined by the noise proper in the 
wave channel, while the unity corresponds to the "effective noise" thanks to the 
inaccuracy in the ideal indirect measurement. The measurement noise of unit 
intensity can be interpreted as the noise produced by an ideal wave amplifier or 
as the noise produced by an ideal optical heterodyne. 



3.3.3 Optimal Measurement of Wave States 

Let X be a set of hypotheses concerning the states of a wave field, and {Rx, x G 
X} the respective decomposable family of density operators Rx = ®„ i?!"'' in 
the Hilbert space H — 0„H^"^ Then the set of measurements described 
by operator-valued measures M on X{M{-) > 0, / M{dx) — I) of the de- 
composable form M{dx) — 0„ M'^"^ dx is sufhcient. The optimal strategy is 
described by the family of operator-valued measures M(") on X, i.e. M(")(-) > 
0, / M(")(da;) = defined independently in W^") for every n by the condi- 
tions 

= A("))M(")(dx) = 0, < A(") Vz € a: (3.57) 

(the maximum intensity criterion). Here A^"^ are Hermitian trace class opera- 
tors in TY^"' that are nonnegative (for i?^"'' > 0) and can be represented in the 
form 

A^ i?(,")M(")(da;)i?i"). 

Problem (|3.57|) is incomparably simpler than the general problem of optimal 
discrimination of a family {Rx} and for every n has a finite-dimension of space 
7i^") if the signal space C is finite-dimensional. In what follows, the index n 
will be dropped. 

Let Ux — RxH be the range of values of operators Rx in H, let U{dx) be 
their algebraic sum for all x G dx, and \ei U — J U{dx) be the sum of all the 
subspace Ux C 7i. Each nonnegative operator R^ can be represented in the form 
Rx = V'kV'xi where il)^ is the operator hovalAx vaiolA. The following conjectures 
are multidimensional generalizations of the appropriate assertions of Theorem 
|31(for a discrete set X). 
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Theorem 16 (1) Subspace U is sufficient for solving problem (|3.57|l . Every 
operator A satisfying conditions A > R^ & X for R^ > has an inverse A~^ 
in hi. 

(2) The solution to problem H3.57|l in the sufficient space lA has the form 



and jjL is a measure on X whose values fi (dx) are nonnegative operators in 
defined by the conditions 



[Ix is the identity element inlAx)- 

(3) // the subspace hi (dx) does not intersect with the sum hi {dx) of all the re- 
maining subspaces hiy, y ^ dx, the operator jj.{dx) is strictly positive in Ux and 
is defined by the condition ipxA~^'ipx — Ix, x G X . 

3.3.4 Waves with Group Symmetry 

Equations (|3.59|) are considerably simpler than Eqn. (|3.57|) since the dimension- 
ality of each operator equation in H3.59|l is equal to rank r{Rx). For the case 
where r{Rx) = 1 the solution has been found jjl] under the condition that the 
square root of the correlation matrix [V'xV'j^] has equal diagonal elements. An 
analog of this condition in the general case where r{Rx) > 1 is the condition of 
group (say, cyclic in 12 ) symmetry of the family {Rx}- 

Let X be a homogeneous set with respect to a group G, that is, group G 
acts on X transitively, and let U{g), g € G,he & unitary representation of G in 
TC. The family {Rx, x e X} is said to be G- homogeneous (or G- invariant) if X 
is a homogeneous set with respect to group G and U{g)RjxU*{g) = Rx- 

Let G be a finite compact or locally compact group, d^ be the left Haar 
measure on G, the family {Rx} be homogeneous and continuous in U{g). The 
following conjectures are true: 

Theorem 17 (1) The sufficient space hi is a subspace in Ti. cyclically generated 
by the family {U{g), g € G} overhio — Rx^Ti., where Xq is any element belonging 
to X . The operators Rx in hi can be represented in the form 



where U{g) is a subrepresentation induced in hi C H, ip an operator from Uq 
into hi, and Gx the left coset Gx = {g ■ gxo — x} over the stationary subgroup 
Go = GxQ of element xq- 

(2) The optimal strategy (|3.58|l has the covariant form 



M{dx) = A^^^p^fi{dx)^px^^^, where A 




(V'^A-Vx - Ix)Kdx) = 0, ^*xA-^^x >lx yx€X 



(3.59) 



Rx^U{gyjrU*{g) VgeG, 




G(dx) 




U*{g)Ag, 



(3.60) 



3.3. OPTIMAL AND COVARIANT ESTIMATION OF WAVE PATTERNS79 



where G{dx) = Uaieda; union of the Gx over all x £ dx, 

A= (y Uig)ibfirU*{g) d<?)'^' 
is the G-invariant, and fl a non-negative operator in Uq satisfying the conditions 

(V*A-V-^) A = 0, V'*A"V<^- (3.61) 

(3) // the representation U{g) in hi is topologically irreducible, then operator A 
is a multiple of the identity element I of space IAq : K = \I and operator fi is 
proportional to the proper projector tt of operator R — ■ip*'il) corresponding to its 
maximal eigenvalue X : jj, = ^tt, {R — A)7r = 0. 

The proportionality factor can be made equal to unity by appropriately 
renormalizing Ag. For the particular case where G is finite and its action on X 
is effective this result was obtained earlier in [21]. If G is an Abelian group, the 
case is trivial and can be of no interest. 

(4) Let Uu]{g), w G ri, he the field of nonequivalent irreducible representations 
Uui{g) in space Ti^, md dto the Plancherel measure. Then H3.61(l can be repre- 
sented in the form 

[ TrnjRu.fi)'^^ duj ^ fi, [ Tr„Ji?^A)-i/2i?^ dc^ < /, (3.62) 
Jn Jn 

where R^ = J f(g)Ui^{g) dg is the Fourier transform of the operator correlation 
function f{g) — ip*U*{g)'ip. If the family {C/w(')} ^•s discrete and dw is the di- 
mensionality of representations Ui^{g) {formally, ifTi.^ is infinite- dimensional), 
then conditions H3.62|l assume the form 

TruAR^f^Y'^duj ^ fl, TvH^{R^fir^'^Ruduj<i. (3.63) 

If U[) is onc-dimcnsional, (|3.62l) and H3.63|l lead us to the following solutions: 

Tvn^Rl!^ dc.)', /i = (5]Tr«^i?i/2 dc^)'^', 
o 

which were found in ^ (that is, the case of group symmetry for pure states 
is "equidiagonal" ) . The rank of operators R^^ determines the multiplicity of 
representations Uui{g) in the representation U{g) in U. 

(5) Suppose that the multiplicity of representations U^j^g) in the representation 
U{g) is unity. Then the operators R^j are one-dimensional, i?^ = V'li'X'V'aj- '^nd 
Eqs (|3.62|l and H3.63|l assume the form 

( y" S'i^ duj/ c^ - i^fi ^ 0, ( ^ 5"^ duj/ c^ ~ i^^fi^ 0, 

where S^ = Trn^Ruj, and = [Trj^„(S'„/i)]^/2. 
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In particular, if all Suj are commutative, then operator fi is a multiple of 
the proper projector tt of operators Si^, which corresponds to the eigenvalues X,^ 
with maximal 

(or ^= (^J2^l/^ )■ A = A^TT, (5^^ - Aw)^ = 0. 

3.3.5 Application to a Group Symmetry and Indetermi- 
nate Phase 

To apply the above results to the case of a decomposable G-homogencous family 
of density operators Sx = ®nRx it is sufficient to supply all the spaces and 
operators in H3.57(l - (|3.62(l with an index n and then sum over n. In particular, if 
the representations U'''^\g) in 7Y'"' are n-th tensor powers of the representation 
of U{g) in C, then the solution of the problem of optimal recognition of audio 
and optical fields is reduced to finding the irreducible representations U^{g) 
contained in U^^\g). The operators determining H8.62|l and H3.68|l are 

- j r(")(5)f/.(5)d5 

where 

For example, if the states Sx are Gaussian, the family of signals {'Pxi^ ^ ^} 
is G-homogeneous: U{g)ipgx = V^j ^iid the correlation noise operator L (or N) 
is G-invariant: {/(5)Lt/*((7) = L, then the family of the i?^"^ operators is also 
G-homogeneous with respect to the appropriate tensor powers U^"\g) of the 
representation U{g) in the subspace U generated by vector ip = ip^^ for a certain 
xq € X. In this manner we can find the exact solution to the following problems: 
resolution of several nonorthogonal partially coherent signals or fields that form 
a homogeneous family of permutations with respect to a certain group (sym- 
metric groups S{r) and their subgroups), estimation of the time lag of pulsed 
signals and the carrier frequency in quasiperiodic signals (cycle Z groups), joint 
measurement of the duration and the frequency of a wave packet (the symplec- 
tic group), separate or joint measurement of momenta and position of quantum 
systems (and ensembles of such systems) with r degrees of freedom (the Z{r) 
groups), detection of photon polarization and electron spin (the SU{2) group), 
detection of complex signals and fields with equal intensities of rank r against 
a thermal background (the SU{r) groups and their subgroups), and the like. 



Afterword 



Optimal wave tomography as mathematical wave-pattern recognition theory, 
emerged in the 70 's first in connection with the problems of optimal estimation 
and hypothesis testing in quantum theory, is a new pattern recognition theory. 
The key problems in this theory, mathematical design of the optimal dynamic 
analyzer discriminating between a given family of pure or mixed a priori un- 
known wave patterns, is thoroughly studied here. Like the problem of optimal 
quantum measurement it cannot be tackled by the methods of classical math- 
ematical statistics which is not concerned how these date should be obtained 
from the physical wave states in an optimal way. We extended here the results of 
optimal quantum measurement theory obtained in |2H|~|Sni into the direction of 
wave, rather than particle statistical estimation and hypothesis testing theory, 
naturally including into the wave tomography not only quantum matter waves 
but also classical wave patterns like optical and acoustic waves. The developed 
methods are applied to the problems of mathematical design of optimal wave 
analyzer discriminating the visual and sound patterns. Thus, Hilbert space and 
operator methods, developed first in quantum theory of optimal quantum mea- 
surement, are found to be equally useful in the classical wave theory where the 
possible observations are restricted to only intensity distributions of waves, i.e. 
when the wave states are not the allowed observables, as they are not the ob- 
servables for individual particles in the quantum theory. It has been shown that 
all the attributes of quantum measurement theory such as complementarity, en- 
tanglements or Heisenberg uncertainty relations have also an exact reflection in 
the wave pattern recognition theory. 
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